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TRANSLOCATION DEPENDENT COMPLEMENTATION FOR DRUG SCREENING 

Field of invention 

The present invention relates to various uses of complementing proteins protein 
5 fragments to discover chemical compounds or drugs interfering with protein translocation 
and/or protein-protein interactions. The invention takes advantage of the fact that many 
interacting proteins reside in separate and distinct locations prior to the activation of the 
signaling pathways in which they play their part. The invention also takes advantage of 
certain interaction domains that can be induced to interact by application of specific 
10 stimuli, by which means the complementation signal may be enhanced for those 
complementary components that become directed to the same intracellular location 
following activation of the signaling pathway in which they act. 

Background of the invention 

The interaction of proteins with each other and with other cellular components is an 
15 intrinsic part of nearly every cellular process, and this is especially true of intracellular 
signaling systems. Information is typically passed through and between signaling systems 
by a series of such interactions. 

Existing methods for identification of interacting species can be divided into two groups: 
First are those methods that can only work with more or less purified components brought 

20 together in vitro, such as surface plasmon resonance (evanescent wave methods), protein 
mass spectroscopy, fluorescence correlation spectroscopy and anisotropy measurements 
all with the common feature that the components of interest are isolated from the cellular 
context. The second group includes all methods designed to work within living cells. Of 
these, many have been developed to work in yeast cells (yeast two hybrid, reverse yeast 

25 two hybrid and variations thereof) but some have been adapted for use in mammalian cell 
systems. Cellular methods for detection of protein interactions have been well reviewed 
by Mendelsohn, A.R., Brent, R. (1999) (Science 284(5422): 1948). Many of these methods 
are descendants of the conventional two-hybrid methods, where transcriptional activity is 
initiated by the bringing together of bi-partite transcription factors through the interaction of 

30 attached "bait" and "prey" components, while other methods rely on reconstitution of a 
biochemical function in vivo. Rossi et al. (2000) (Trends in Cell Biology 10:119-122) have 
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thus developed a mammalian cell-based protein-protein interaction aissay where the read- 
out is not transcriptional but reconstitution of a mutated beta-galactosidase enzyme. Upon 
reconstitution of the tetrameric enzyme, enzymatic activity can be monitored. In addition 
methods for monitoring protein-protein interactions that are based on an optical read-out 
5 i.e. fluorescence resonance transfer (FRET) , or coincidence analysis (a variant of 
fluorescence correlation spectroscopy), or fluorescence lifetime changes. The last three 
categories are more normally applied under simplified in vitro conditions, but attempts are 
being made to move them into the more complex environment of the living cell. 

It has been suggested to use the reassembly of certain enzyme fragments of the 
1 0 complete enzyme as a measure of protein-protein interactions. Johnsson and Varshavsky 
(Johnsson, N., Varshavsky, A. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 10340-10344) 
disclose reassembly of Ubiquitin. This reassembly is detected through the irreversible 
cleavage of the fusion by Ubiquitin protease and release of a reporter. As opposed to the 
two-hybrid technique, this technique includes the possibility of monitoring a protein-protein 
1 5 interaction as a function of time, at the natural sites of this interaction in a living cell. 

Similar systems are suggested for the reassembly of other proteins including 0- 
galactosidase (Rossi, F., Charlton, C.A., Blau, H.M. (1997) Proc. Natl. Acad. Sci. U. S. A. 
94, 8405-8410), dihydrofolate reductase (DHFR, WO98/34120), and p-lactamase 
(Wehrman, T., Kleaveland, B., Her, J.H., Balint, R.F., Blau, H.M. (2002) Proc. Natl. Acad. 

20 Sci. U. S. A. 99, 3469-3474). The basic concept is that by splitting a functional protein in 
two fragments, the function is lost. The two fragments are transformed or transfected into 
cells fused in frame to proteins X and Y, respectively. Binding between proteins X and Y 
will bring the two fragments close together, increasing the local concentration of the 
complementing fragments, induce folding of these fragments and produce a functional 

25 protein with an activity that is similar to that of the non-fragmented protein. If the function 
is DHFR activity, the cells will survive only if proteins X and Y bind to each other. 

Recently, it has been described to use a somewhat similar system for the assisted 
reassembly and folding of fragments of fluorescent proteins. As the function is 
fluorescence, the cells will emit light upon excitation only if protein X and protein Y bind to 
30 each other thereby assisting complementation. Ghosh (I. Ghosh, A.D. Hamilton, L. Regan 
(2000) J. Am. Chem. Soc. 122, 5658-5659, WO01/87919) describes the use of a GFP 
variant called sg100 (F64L, S65C, Q80R, Y151L, I167T and K238N). This GFP has single 
fluorescence excitation and emission peaks at 475 nm and 505 nm, respectively (similar 
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to sg25 described by Palm (Palm, G.J., Zdanov, A., Gaitanaris, G.A., Stauber, R., 
Pavlakis, G.N., Wlodawer, A. (1997) Nat. Struct Biol. 4, 361-365)). Functional GFP 
fragment complementation is accomplished by co-expressing two independent peptides 
composed of the first 157 N-terminal amino acids of this GFP (NtermGFP157) and the 
5 remaining 81 C-terminal amino acids (starting form residue 158) of this GFP 
(CtermGFP158) with each of the GFP peptide fragments being fused to interacting 
leucine zipper peptides that serve to associate the fragments. 

The recruitment or re-localisation of proteins plays a major role in many key signalling 
10 systems. This is evident in (i) the activation of protein kinase C (PKC), where recruitment 
to the plasma membrane is an inherent part of the activation process of this enzyme; (ii) 
the activation of p42/44 MAP Kinase depends on multiple proteins whose transfer from 
cyfosolic to membrane location has a central role and (iii) the cAMP-driven relocalisation 
of rapl is crucial to its activation. The use of protein translocation for identification of 
15 potential medicaments is disclosed in WO98/45704. However, there is still a need for 
alternative techniques for detecting the localisation and any re-localisation of proteins in 
intact, living cells. Current detection techniques are generally characterised by low 
throughputs and requirements for expensive detection equipments. 

Summary of the invention 

20 The present application discloses that certain GFPs can be reassembled and form a 
functional fluorescent protein when expressed as two independent proteins halves. For 
. example, when EGFP is expressed in mammalian cells, choosing a split site located in a 
loop region between the residues that form the beta-sheet structures of the GFP beta- 
barrel results in intense fluorescence (Example 5 and Example 7). The present application 

25 further illustrates that EYFP is also reassembled and, surprisingly, the fluorescence from 
the reassembled protein is markedly enhanced if it contains the F64L mutation (Example 
9). 

The present application demonstrates that a model GFP complementation system using 
components which can be made to interact conditionally does respond as expected in a 
30 dose-dependent manner to the interaction stimulus, and, importantly, can be used to 
detect compounds that will block the conditional interaction of proteins fused to the 
complementary halves of a protein (e.g. a GFP complementation system) (Example 11) 
as almost no spontaneous reassembly (complementation) is observed. 
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The examples herein disclose that for spontaneous interacting proteins, a 
complementation system can be designed to report on the degree of interaction between 
those proteins in living cells, with the option to add compounds before such interaction 
takes place. This is achieved by holding the two proteins in separate locations of the cell 
5 thereby only allowing interaction to take place after translocation of one or both proteins to 
bring them together in one compartment or location within the cell has taken place 
(Example 16). 

One aspect of the present invention builds on the concept that signaling proteins which 
will interact with each other when active often reside in distinct and spatially separated 

1 0 intracellular locations prior to the activation of the signaling pathway in which they act, 
thus avoiding spontaneous reassembly of the complementary system used to monitor the 
interaction of those proteins. This is clearly illustrated in Example 14, where induction of 
translocation from one localisation to another causes the N~ and C- terminal of the 
complementation proteins to reassemble and exert the function (in this case 

15 fluorescence). A similar system is described in Example 16 where constitutive interacting 
proteins has been replaced with proteins of induced interaction. 

Detailed disclosure 

The present invention builds on the knowledge of localisation of proteins inside the cell 
and using this to keep the two half s of an enzyme apart until in a controlled fashion they 
20 are brought back together. Thus, a major embodiment of the invention relates to a cell 
comprising at least: 

- a first conjugate comprising a first protein and the N-terminal fragment of 

complementation protein; and 

- a second conjugate comprising a second protein and the C-terminal fragment of the 
25 complementation protein; 

wherein said first conjugate has a predominant cellular location that is distinct and 
spatially separated from where said second conjugate is predominantly located, 

wherein said first protein and said second protein will bind to each other when conditions 
allow, and 

30 wherein said complementation protein exhibits altered characteristics when the two 

fragments of the complementation protein are brought into close apposition and the 
two fragments of the complementation protein form the full functional protein. 
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One use of the invention is to be able to identify drugs that will prevent the interaction 
between the first protein and the second protein. To avoid interaction before such drug 
has been applied, one important feature of the invention is that the two conjugates are 
5 held in different cellular locations. Most, if not all, proteins are in equilibrium between 
various states. In the present context, this equilibrium will represent the balance between 
being in one location and being in another location within the cell. For the purpose of the 
present application, two different cellular locations are understood as predominant or 
preferred positions of the two proteins making interaction between the two proteins 

10 unlikely, that is, the conditions for their meeting are not favored. In the cases where the 
balance between two locations is shifted far to one side, the protein will predominantly be 
in one of these locations. Activation of the system here means the result of any treatment 
of the system that initiates translocation, or any other cellular signaling process, that 
makes complementation of the two conjugates more favorable - that is, activation of the 

15 system shifts the equilibrium of at least one of the proteins from a different cellular 
location from the other protein, to the same cellular location as the other protein. As an 
example, PKCp proteins are found predominantly, but not exclusively, in the cytoplasm in 
unstimulated cells, whereas the interaction partner protein, RACK1, is predominantly 
located at the plasma membrane. Upon application of a stimulus such as the phorbol 

20 ester PMA, PKCp proteins take up a location predominantly at the plasma membrane, 
where they are therefore brought into close proximity to RACK1 . Another example of 
signaling proteins that translocate to and from the plasma membrane are STAT 
transcription factors and AKT/PBK kinase. Examples of signaling proteins that translocate 
between the cytoplasm and the nucleus are NF kappaB transcription factor, the androgen 

25 receptor, the MAP kinases p38, ERK and JNK, MAPKAP kinase 2 and Forkhead 
transcription factor. 

Thus for most uses of the invention, the cellular location of the first conjugate is 
determined by the nature of the first protein. Likewise, the cellular location of the second 
conjugate is determined by the nature of the second protein. 

30 

The presently known complementation proteins causes irreversible complementation 
because the correctly folded complexes of the complementing fragments are energetically 
very favourable, i.e. the folded fragments dissociate only very slowly if at all. Thus, if the 
complementation starts as soon as the proteins are in the same location, due to 
35 spontaneous binding between the first protein in the first conjugate and the second protein 
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in the second conjugate, the small fraction of protein in the not favored location will be 
irreversibly bound. When irreversibly bound, it no longer participates in the equilibrium of 
free protein conjugate between the unfavored and favored locations and inevitably more 
of the protein conjugate can go to the unfavored location. Ultimately, most or all of the 
5 protein conjugate can be drawn by the equilibrium to the unfavored location. This is 
referred to a sink effect. One way to minimize the spontaneous meeting between the two 
proteins, is to associate one of the proteins with an anchor protein that will severely limit 
its mobility in the cell. In this case the one protein is fixed in one localisation, and the 
meeting between the complementing fragments will only take place if the other protein is 
1 0 moving to the anchored position. 

In this aspect of the invention, the second conjugate further comprises an anchor protein, 
wherein said anchor protein is anchored in a different cellular compartment from where 
said first protein is predominantly located. 

1 5 Another use of the location information wherein constitutively interacting proteins are kept 
apart until the two conjugates are purposely brought together is to measure translocation 
of a protein. This can be done in a cell comprising at least: 
- a first conjugate comprising a protein, interaction partner A and the first terminal 
fragment of a complementation protein; and 
20 - a second conjugate comprising a interaction partner B and the second terminal fragment 
of the complementation protein; 
wherein said first conjugate has a predominant cellular location that is distinct and 
spatially separated from where said second conjugate is predominantly located, 
wherein said complementation protein exhibits altered characteristics when the two 
25 fragments of the complementation protein are brought into close apposition and the 
two fragments of the complementation protein form the full functional protein and 
wherein said interaction partner A and interaction partner B bind to each other. 

In such cell, translocation of the protein, and thereby the first conjugate, to the same 
cellular location as the second conjugate, will cause complementation of the first and 
30 second terminal fragments of the complementation protein. 



One way to minimize the effect of spontaneous complementation between the two 
proteins (the sink effect, which generates an unwanted background signal), is to include in 
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the conjugates interaction partners A and B that will only bind when a specific interaction 
stimulus has been applied (conditional interaction partners). In this way the spontaneous 
meeting between the proteins will only cause complementation of the two 
complementation fragments when the interaction stimulus causes the interaction partners 
5 to bind and thereby bring the complementation fragments in close apposition for the . 
reassembly and formation of a functional protein. In this embodiment interaction partner A 
and interaction partner B bind to each other only when an interaction stimulus has been 
applied. 

To obtain further control of the location of the conjugates and minimize the spontaneous 
10 meeting between the two conjugates, is to associate one of the conjugates with an anchor 
protein that will severely limit its mobility in the cell. In this case the one protein is fixed in 
one localisation, and the meeting between the complementing fragments will only take 
place if the other protein is moving to the anchored position. In this aspect of the 
invention, the second conjugate further comprises an anchor protein, wherein said anchor 
15 protein is anchored in a different cellular compartment from where said first protein is 
predominantly located. 

In one embodiment of the invention the first terminal fragment of the complementation 
protein is the N-terminal fragment of the complementation protein and the second terminal 
20 fragment of the complementation protein is the C-terminal fragment of the 

complementation protein. In another embodiment of the invention the first terminal 
fragment of the complementation protein is the C-terminal fragment of the 
complementation protein and the second terminal fragment of the complementation 
protein is the N-terminal fragment of the complementation protein. 

25 The simplicity of the present invention is in part derived from the complementation protein. 
The fact that the complementation protein exhibits altered characteristics when the two 
fragments of the complementation protein are brought into close apposition and the two 
fragments of the complementation protein form the full functional protein, makes detection 
of the close apposition between the two fragments of the complementation protein easy. 

30 In principle any detectable property of a combined mass that is measurably different to 
that of the separate constituent parts may be used as a signal of complementation of 
those separate parts. Such properties include, but are not limited to, biophysical 
properties such as rotational or translational diffusion coefficients, fluorescence properties 
such as peak excitation or emission wavelengths, or fluorescence lifetimes. Potentially 
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useful detectable properties also include those of a more biochemical nature, such as the 
acquisition of catalytic efficiency or enzymic activity, or the efficacy of conversion of a 
polypeptide sequence inherent to one of the complementary partners to a form that 
acquires the ability to fluoresce or luminesce. Examples of proteins that can be made to 
5 acquire catalytic activity when the constituent parts are assembled through 

complementation are Ubiquitin (Johnsson, N., Varshavsky, A. (1994)) p-galactosidase 
(Rossi, F et al (1 997)) and p-lactamase (Wehrman, et al (2002)). 

Below it is described in detail how GFP can be divided to form two complementing 
fragments, as such data are not available on the priority date of the present invention. 

1 0 However, as will be evident to the skilled person, other proteins can be divided as well. 
The non-fluorescent fragments of fluorescent proteins that can be combined to form one 
functional fluorescent unit are usually produced by splitting the coding nucleotide 
sequence of one fluorescent protein at an appropriate site and expressing each 
nucleotide sequence fragment independently (as fusion proteins) as the N-terminal 

15 fragment and the C-terminal fragment of the complementation protein (here GFP). 

In one embodiment of the invention, the complementation protein is a fluorescent protein 
such as a Green Fluorescent Protein (GFP). 

Green Fluorescent Protein (GFP) is a 238 amino acid long protein derived from the 
20 jellyfish Aequdrea Victoria (SEQ ID NO: 1). However, fluorescent proteins have also been 
isolated from other members of the Coelenterata, such as the red fluorescent protein from 
Discosoma sp. (Matz, M.V. etaL 1999, Nature Biotechnology 17: 969-973), GFP from 
Renilla reniformis, GFP from Renilla Muelleri or fluorescent proteins from other animals, 
fungi or plants. The GFP exists in various modified forms including the blue fluorescent 
25 variant of GFP (BFP) disclosed by Heim et al. (Heim, R. et a/., 1994, Proc.Natl.Acad.Sci. 
91 :26, pp 12501-12504) which is a Y66H variant of wild type GFP; the yellow fluorescent 
variant of GFP (YFP) with the S65G, S72A, and T203Y mutations ( WO98/06737); the cyan 
fluorescent variant of GFP (CFP) with the Y66W colour mutation and optionally the F64L, 
S65T, N1461, M153T, V163A folding/solubility mutations (Heim, R., Tsien, R.Y. (1996) Curr. 
30 Biol. 6, 178-182). The most widely used variant of GFP is EGFP with the F64L and S65T 
mutations (WO 97/1 1 094 and W096/2381 0) and insertion of one valine residue after the 
first Met. The F64L mutation is the amino acid in position 1 upstream from the chromophore. 
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GFP containing this folding mutation provides an increase in fluorescence intensity when the 
GFP is expressed in cells at a temperature above about 30°C (WO 97/1 1094). 

It is known that fluorescence in wild-type GFP is due to the presence of a chromophore, 
which is generated by cyclisation and oxidation of the SYG at position 65-67 in the predicted 
5 primary amino acid sequence and presumably by the same reasoning of the SHG sequence 
in other GFP analogues at positions 65-67. 

The present examples clearly illustrate how the fluorescence intensity from a reassembled 
protein is enhanced in GFPs containing the F64L mutations as compared to GFPs without 
this mutation. Thus, it is preferred that the GFP contains the F64L mutation, either by 
1 0 electing a GFP with this mutation (e.g. EGFP) or to introduce this mutation into the GFP of 
choice (e.g. YFP as illustrated in Example 9). 

In the nomenclature of GFP, an "E" is placed in front of the GFP (EGFP, EYFP, ECFP) to 
indicate that this particular GFP is encoded by a nucleic acid with codon usage optimised for 
mammalian cells. Most of these proteins also have an extra valine residue inserted after the 
15 initial methionine residue, Met 1 . This extra valine residue is not considered in the numbering 
of the residues. Thus, in a preferred embodiment, the GFP of the present invention is 
selected from the group consisting of EGFP, EYFP, ECFP, dsRed and Renilla GFP. 

Some of the examples of the present application, EGFP is used, thus, in a preferred 
embodiment of the invention, the GFP is EGFP. However,. Example 9 and Example 11 
20 show that EYFP has certain advantages. Thus, in another preferred embodiment of the 
invention, the GFP is EYFP. It is also shown that EYFP mutated in position 1 preceding 
the chromophore (E[F64L]YFP) has specific advantages. Thus, in a preferred 
embodiment the GFP is E[F64L]YFP, or the GFP is E[S72A]GFP (that is F64L, S65T, 
S72A GFP). 

25 

One aspect of the invention relates to a cell as described above, wherein N-terminal 
fragment of the complementation protein is an N-terminal fragment of GFP, comprising a 
continuous stretch of amino acids from amino acid number 1 to amino acid number X of 
GFP, wherein the peptide bond between amino acid number X and amino acid number 
30 X+1 is within a loop of GFP and C-terminal fragment of the complementation protein is a 
C-terminal fragment of GFP, comprising a continuous stretch of amino acids from amino 
acid number X+1 to amino acid number 238 of GFP. 
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In the present context, the numbering of wild-type GFP (SEQ ID NO: 1) (Chaifie, M., Tu f 
Y. f Euskirchen, G., Ward, W.W., Prasher, D.C. (1994) Science 263, 802-805, this variant 
of GFP has a histidine residue in position 231) is used. Based on the crystal structure of 
GFP (Yang, F., Moss, LG., Phillips, G.N. (1996) Nat. Biotech. 14, 1246-1251) Figure 5, 
5 Table 1 and the data presented in the examples, it is evident that a split in almost any 
loop will be re-assembled following appropriate spatial approximation to the 
complementation fragments assisted by the interaction of the conjugated proteins. For the 
purpose of this application the term "loop" shall be understood as a turn or element of 
irregular secondary structure. 

10 Thus, in one aspect, the complementation protein is two GFP fragments 

wherein X is 7, 8, 1 1 or 12, preferably X is 9 or 10 within the Thr9-Val1 1 loop; or 
wherein X is 21 , 22, 25 or 26, preferably X is 23 or 24 within the Asn23-His25 loop; or 
wherein X is 36, 37, 40 or 41, preferably X is 38 or 39 within the Thr38-Gly40 loop; or 
wherein X is 46, 47, 56 or 57, preferably X is between 48 and 55 i.e. X is 48, 49, 50, 51, 

15 52, 53, 54 or 55 within the Cys48-Pro56 loop; or 

wherein X is 70, 71, 76 or 77, preferably X is between 72 and 75 i.e. X is 72, 73, 74 or 75 

within the Ser72-Asp76 loop; or 
wherein X is 79, 80, 83 or 84, preferably X is 81 or 82 within the His81-Phe83 loop; or 
wherein X is 86, 87, 90 or 91 , preferably X is 88 or 89 within the Met88-Glu90 loop; or 

20 wherein X is 99, 100, 103 or 104, preferably X is 101 or 102 within the Lys101-Asp103 
loop; or 

wherein X is 112, 113, 118 or 119, preferably X is between 114 and 117 i.e. X is 114, 115, 

1 16 or 1 17 within the Phe1 14-Thr1 18 loop; or 
wherein X is 126, 127, 145 or 146, preferably X is between 128 and 144 i.e. X is 128, 129, 
25 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144 within the 

He 128-Tyr145 loop; or 
wherein X is 152, 153, 160 or 161, preferably X is between 154 and 159 i.e. X is 154, 155, 

156, 157, 158 or 159 within the Ala154-Gly160 loop; or 
wherein X is 169, 170, 175, 176, preferably X is between 171 and 174 i.e. X is 171, 172, 
30 173 or 174 within the Ile171-Ser175 loop; or 

wherein X is 186, 187, 197 or 198, preferably X is between 188 and 196 i.e. X is 188, 189, 

190, 191, 192, 193, 194, 195 or 196 within the Ile188-Asp197 loop; or 
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wherein X is 208, 209, 215 or 216, preferably X is between 210 and 214 i.e. X is 210, 211, 
212, 213 or 214 within the Asp210-Art215 loop. 

Table 1 GFP secondary structures, GFP wild type sequence amino acid numbering, a 
and p indicate a-helical and p-sheet secondary structures, respectively. 



Name 


Position 




Helix 1 


Lys3 - Thr9 


a1 


Sheet 1 


Val11 -Asn23 


01 


Sheet 2 


His25 - Thr38 


B2 


Sheet 3 


Gly40 - Cys48 


63 


Helix 2 


Pro56 - Ser72 


a2 


Helix 3 


Asp76 - His81 


a3 


Helix 4 


Phe83 - Met88 


a4 


Sheet 4 


Glu90- Lys 101 


P4 


Sheet 5 


Asp103-Phe114 


p5 


Sheet 6 


Thr118-lle128 


P6 


Sheet 7 


Tyr145-Ala154 


P7 


Sheet 8 


Gly160-lle171 


p8 


Sheet 9 


Ser175- Ile188 


p9 


Sheet 10 


Asp197- Asp210 


pio 


Sheet 1 1 


Arg215-Gly228 


P11 



5 



Based on the findings disclosed in the examples, it is concluded that appropriate splitting 
sites in GFP are located in the loop regions between the residues that form the beta-sheet 
structures of the GFP beta-barrel. Accordingly, splits in GFP are preferably made in the 
Asn23-His25 loop, the Thr38-Gly40 loop, the Lys101-Asp102 loop, the Phe114-Thr118 
10 loop, the Ile128-Tyr145 loop, the Ala154-Gly160 loop, the Ile171-Ser175 loop, the Ile188- 
Asp197 loop or the Asp210-Arg215 loop (Table 1, Figure 5). 

The data in the present examples illustrates clearly that the Ala154-Gly160 loop is very 
well suited for GFP reassembly. This is particularly the case when the GFP is divided 
between amino acids Q157 and K158 (that is, when X is 157). Thus, a preferred 
15 embodiment of the invention relates to two GFP fragments, wherein X is 157 within the 
Ala154-Gly160 loop. 
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The data in the present examples also illustrate that the Iie171-Serl75 loop is very useful 
for GFP reassembly. This is particularly the case, when the GFP is divided between 
amino acids E172 and D173 (that is, when X is 172). Thus, a preferred embodiment of the 
invention relates to two GFP fragments, wherein X is 172 within the Ile171-Ser175 loop. 

5 

One alterative aspect of the invention relates to the cell described above, wherein the N- 
terminal fragment of the complementation protein is an N-terminal fragment of GFP, 
comprising a continuous stretch of amino acids from amino acid number 1 to amino acid 
number X of GFP, wherein the peptide bond between amino acid number X and amino 
10 acid number X+1 is within a loop of GFP and the C-terminal of the complementation 
protein is a C-terminal fragment of GFP, comprising a continuous stretch of amino acids 
from amino acid number Y+1 to amino acid number 238 of GFP, wherein Y<X creating an 
overlap of the two GFP fragments, and wherein the peptide bond between amino acid Y 
and amino acid Y+1 is within a loop of GFP. 

15 As illustrated in Example 9, fragments having overlapping sequences have certain 
advantages. Thus one aspect of the invention relates to two GFP fragments comprising 
(a) an N-terminal fragment of GFP, comprising a continuous stretch of amino acids from 
amino acid number 1 to amino acid number X of GFP, wherein the peptide bond between 
amino acid number X and amino acid number X+1 is within a loop of GFP and 
. 20 (b) a C-terminal fragment of GFP, comprising a continuous stretch of amino acids from 
amino acid number Y+1 to amino acid number 238 of GFP, wherein Y<X creating an . 
overlap of the two GFP fragments, and wherein the peptide bond between amino acid Y 
and amino acid Y+1 is within a loop of GFP. 

These overlapping GFP fragments are very attractive in e.g. functional cloning systems 
25 where highly flexible linkers sequences are required due to the very diverse nature of the 
structures of fusion partners. The overlapping fragments, permit either of the fusion 
partners to have a long linker sequence. 

For the purposes of deciding the nature of the Y in the C-terminal fragment of GFP 
defined above, the same considerations as discussed for the value of X applies. 



30 



In one embodiment of the invention the overlap is just a few amino acid residues, e.g. X-Y 
=1, X-Y=2, X-Y=3, X-Y=4, X-Y=5, X-Y=6, X-Y=7, X-Y=8, X-Y=9 orX-Y=10. 
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Due to the folding characteristics of the folding of GFP, a preferred embodiment of the 
invention relates to overlapping N-terminal and C-terminal fragments of GFP wherein the 
peptide bond between amino acid Y and amino acid Y+1 and the peptide bond between 
amino acid X and amino acid X+1 is within a loop of GFP. The thereby obtained overlap is 
5 an entire a-helix or £-sheet secondary structure 

In a typical embodiment of the invention, the conjugated protein as described above, 
wherein the N-terminal fragment of the complementation protein (the first terminal 
fragment of the complementation protein) is conjugated to a protein of interest, further 
1 0 comprises a linker sequence between the fragment of the complementation protein and 
the corresponding protein of interest 

The length and composition of the linker must be chosen depending on the protein that 
one want conjugated to a the complementation protein fragment. The main function of a 
linker is to provide flexibility to avoid steric hindrance of the complementation. The 

15 interacting proteins should be free to interact and at the same time the complementing 
fragments should be free to fold. This is facilitated by long linker sequences, e.g. linkers of 
20 or more residues. However short linker sequences will keep the complementation 
protein fragments closer to each other, i.e. increase the local concentration of . 
complementing fragments, and decrease the entropy cost of complementation. Access to 

20 information about the structures of the interacting proteins or homologous proteins and to 
information about the interaction between the interacting proteins can be valuable 
because it may facilitate the design of short linkers of 5-1 0 residues that do not distort the 
protein interaction or complementation. It will often be advantageous to fuse the 
complementation protein fragments to the termini in the interacting protein that are 

25 brought closest together during the interaction. However, even when linker sequences of 
15-20 residues are used and the termini of the interacting proteins are more than 40 A 
apart, excellent complementation has been observed. 

The linkers should primarily be composed of amino acid residues that are small, 
hydrophilic, are unreactive and constitutes poor protease cleavage sites, e.g. Ser, Thr, 
30 Gly. 

A preferred embodiment of the invention relates to a nucleic acid encoding any of the 
conjugates, fragments or fusions proteins described above. In one embodiment, the 
nucleic acid construct encoding any of the proteins according to the invention described 
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above is a DNA construct. In another embodiment, the nucleic acid construct encoding 
any of the proteins according to the invention described above is a RNA construct. 

As will be evident to the skilled person, the conjugates can be assembled by choice, and 
5 the protein is conjugated to the first terminal of the complementation protein in the N- 
terminal or in the C-terminal. However, as illustrated in the examples, conjugation of the 
first protein of interest to the N-terminal fragment of GFP shall preferably be to the C- 
terminal of the N-terminal fragment of GFP. Likewise, conjugation of the second protein of 
interest to the C-terminal fragment of GFP shall preferably be to the N-terminal of the C- 
10 terminal fragment of GFP. 

As will be evident from the present examples the protein of interest is a protein, a peptide 
or a non-proteinaceous partner. 

Thus, one embodiment of the present invention relates to a cell wherein the N-terminal 
fragment of the complementation protein is fused in frame with the first protein of interest. 

15 One embodiment of the present invention relates to a cell, wherein the first protein is 
fused to the N-terminal of the N-terminal fragment of the complementation protein. 

One embodiment of the present invention relates to a cell, wherein the first protein is 
fused to the C-terminal of the N-terminal fragment of the complementation protein. 

One embodiment of the present invention relates to a cell, wherein the C-terminal 
20 fragment of the complementation protein is fused in frame with the second protein. 

One embodiment of the present invention relates to a cell, wherein the second protein of 
interest is fused to the N-terminal of the C-terminal fragment of the complementation 
protein. 

One embodiment of the present invention relates to a cell, wherein the second protein of 
25 interest is fused to the C-terminal of the C-terminal fragment of the complementation 
protein. 



One embodiment of the present invention relates to a cell, wherein the N-terminal 
fragment of the complementation protein fused in frame to a first protein further comprises 
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a linker sequence between the N-terminal fragment of the complementation protein and 
the first protein. 

One embodiment of the present invention relates to a cell, wherein the C-terminal 
fragment of the complementation protein fused in frame to a second protein further 
5 comprises a linker sequence between the C-terminal fragment of the complementation 
protein and the second protein. 

One embodiment of the present invention relates to a cell, wherein the GFP is EYFP 
further containing an F64L mutation, wherein X is 172, wherein the first protein fused to 
the N-terminal fragment of GFP is fused to the C-terminal of the N-terminal fragment, of 
10 GFP and wherein the second protein fused to the C-terminal fragment of GFP is fused to 
the N-terminal of the C-terminal fragment of GFP. 

One embodiment of the present invention relates to a cell, wherein the GFP is EYFP 
further containing an F64L mutation, wherein X is 157, wherein the first protein fused to 
the N-terminal fragment of GFP is fused to the C-terminal of the N-terminal fragment of 
15 GFP and wherein the second protein fused to the C-terminal fragment of GFP is fused to 
the N-terminal of the C-terminal fragment of GFP. 

It is not a prerequisite that the conjugates are expressed within the cell, however, it is 
preferred that the first conjugate is expressed in the cell. Likewise, in a preferred 
20 embodiment the second conjugate is expressed in the cell. 

In a series of embodiments of the present invention at least one of the conjugates will 
contain an anchor protein. In the present context, an anchor protein means any and all 
cellular components that have a defined cellular distribution, preferably genetically 
♦ 25 encodable cellular components. 

It is important, when selecting the anchor, that the nature of the system is taken into 
account. As an example, some proteins normally require to be phosphorylated or 
dephosphorylated by enzymes sequestered in the plane of the plasma membrane - for 
such proteins of interest it is appropriate to choose an anchor component that would be 
30 expected to be confined to the plasma membrane, to allow the interacting proteins to be 
appropriately modified when measuring protein-protein interactions, or to allow 
measurement of translocation to that location. Thus, in one embodiment, a preferred 
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anchor component that will target the conjugate to the plasma membrane is a protein 
containing the transmembrane domain of the epidermal growth factor receptor (EGFR), or 
containing the transmembrane domain of a protein from the integrin protein family, or 
containing the myristoylation sequence from c-Src (residues 1-14). 

5 In another embodiment, a histone protein is used as the anchor, or a protein normally 
restricted to nucleoli, for example the p120 nucleolar protein, in order to direct the 
anchoring conjugate to the nucleus, thereby allowing measurement of protein-protein 
interactions in the nucleus, or measurement of translocation to the nucleus. 

In another embodiment, the anchor protein is chosen from those proteins normally 
10 confined to mitochondrial outer or inner membranes for example VDAC, F 0 subunit of 
ATP-ase, or NADH dehydrogenase. In another embodiment, the anchor protein is chosen 
from the group of proteins normally confined to the various different regions of Golgi 
bodies for example TGN38 or ADAM12-L. In another embodiment, the anchor protein is 
chosen from the group of proteins normally confined to focal adhesion complexes for 
15 example P125, FAK, integerin alpha or beta, or paxillin. In another embodiment, the 
anchor protein is chosen from the group of proteins normally associated with cytoskeletal 
structures such as F-actin strands or micro tubular bundles for example MAP4, actin 
binding domain of alpha-actinin (actinPaint), kinesins, myosins or dyniens. 

In another embodiment the anchor protein is a protein. normally associated with nuclear 
20 material or nuclear components, such as histone proteins, including histones H1 , H2A, 
H2B, H3, H4, and variants thereof. 

In another embodiment the anchor protein is a protein normally associated with nuclear 
membrane such as A and B type lamins, or associated with nuclear bodies such as 
splicing bodies, Cajal bodies, PML nuclear bodies (PML oncogenic domains, PODS), or 
25 transcription engines such as RNA polymerase POL-II. 

As mentioned, some configurations of the present invention requires spontaneous 
interaction proteins. Examples of protein domains that may be used to bring about 
spontaneous interaction are leuzine zippers and coiled-coil domains generally. These 
30 domains have been identified in many different proteins, and the properties of self- 
association predicted with high accuracy from linear sequences of amino acids. 
Examples of proteins containing leucine zippers are c-Jun and c-Fos. Coiled coil domains 



WO 03/089627 




PCT/DK03/00266 



17 



and other self interacting domains are found in the Rho kinase ROCK, Bcr, Bcr-Abl, all 
known AKAPs and in the regulatory subunits PKA. Other spontaneous binding proteins 
are the binding of JNK to the delta region in c-Jun and the binding of beta-catenin to Tcf-4 
transciption factor. 

5 

The particular utility of stimulus-induced interactions is that in one and the same cell it is 
possible to switch on a distinctive interaction where previously there was only a very weak 
interaction or no interaction. This ensures that, in advance, the distinctive interaction is 
purely a result of specific interaction between the two interaction partners. In the case of 
10 conjugates that include a specific anchor protein that locates the fixed conjugate to a 
specific cellular location, use of a stimulus-induced interaction also guarantees that this 
interaction will give a signal that is measurable by the assay equipment configured to 
detect the expected location of the complementation product. 

In the embodiments using the interaction partners A and B, the specificity and usefulness 
15 of those aspects will depend on interaction partner A and interaction partner B having no 
measurable affinity for each other in the absence of the interaction stimulus. In one 
embodiment therefore, interactors A and B are chosen in appropriate pairs among the 
proteins targeted by immunosuppressants such as, but not limited to, cyclosporin A, 
Rapamycin and FK506. These proteins include, but are not limited to, FKBP12, FRAP and 
20 cyclophilin. In a preferred embodiment, interactors A and B are represented by FKBP12 
and a mutated fragment of FRAP (FRB T2098L, ARIAD Pharmaceuticals) and the 
interaction stimulus this interactor pair is represented by the rapalog AP21967 (ARIAD 
Pharmaceuticals). 

An example of suitable combinations of interaction protein A and interaction protein B and 
25 interaction stimulus are listed below: 



Interaction 


Interaction 


Interaction stimulus. 


partner A 


partner B 




FKBP12 


FRAP 


Rapamycin 


FKBP12 


FRB (T2098L) 


AP21967 



In another embodiment, the ligand-binding domain of a steroid hormone receptor such as, 
but not limited to, the estrogen receptor is used as both interactor A and B. Such a ligand- 
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binding domain will homo-dimerize upon addition of its cognate hormone ligand (in this 
case estrogen) (see Figure 22). 

In another embodiment, the full-length or the ligand-binding domain of a steroid hormone 
receptor is chosen as interactor A whereas interactor B is chosen among the family of 
5 steroid hormone receptor co-activators including, but not limited to, SRC-1 , GRIP-1 , 
ACTR, AIB-1. As above, cognate hormone is used as interaction stimulus. 

One major advantage of the present system is the ability to measure protein-protein 
interactions as a simple change in light intensity. Thus, one aspect of the invention relates 
10 to a method for detecting protein-protein interactions comprising the steps of: 

(a) providing a cell comprising at least: 

- a first conjugate comprising the first protein of interest and a first terminal 
complementation protein; and 

- a second conjugate comprising the second protein of interest and the second terminal 
15 complementation protein; 

wherein said first conjugate has a predominant cellular location that is distinct and 
spatially separated from where said second conjugate is predominantly located, 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

20 complementation between the first and second terminal complemenation protein being 
indicative that the first protein of interest has translocated from the predominant location 
of this first protein of interest to the cellular location where the second protein of interest is 
located and that the two proteins of interest interact (Figure 21 ). 

In the typical use of this method for detecting protein-protein interactions, before step (b) a 
25 stimulus know to cause translocation of the first conjugate is applied. This stimulus could 
be stress, light, a compound or other means. Also, but before adding the translocation 
stimulus, a compound to be tested is added. If this compound is capable of preventing 
complementation, it is capable of preventing the interaction between the two proteins of 
interest, of preventing the complementation as such, or of preventing the translocation of 
30 the first conjugate. 
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An example of this method is to determine the interaction between receptors 
(dimerization) by having the first protein as one receptor and the second protein as the 
other receptor (optionally the same receptor: homo-dimerization). 

As mentioned above, in a typical configuration of such assay, the second conjugate 
5 further comprises an anchor protein (as described in detail above), wherein said anchor 
protein is anchored in a different cellular location from where said first protein of interest is 
predominantly located. 

Another advantage of the present invention is that it allows translocation to be measured 
10 as a change in light intensity. Thus, an aspect of the present invention relates to a method 
for detecting translocation of a protein comprising the steps of: 
(a) providing a cell comprising at least: 

- a first conjugate comprising said protein, a first terminal of a complementation protein, 
and interaction partner A; and 
15 - a second conjugate comprising an anchor protein, the second terminal of the 
complementation protein, and interaction partner B; 

wherein said first conjugate has a predominant cellular location that is distinct and 
spatially separated from where said second conjugate is predominantly located 
and wherein said interaction partner A and interaction partner B bind to each other; 
20 (b) determine if the terminals of the complementation protein has reassembled; 
reassembly (complementation) between the two terminals being indicative that said 
protein has translocated from the predominant location of the protein to the cellular 
location where the anchor protein is anchored (Figure 21). 

In the typical use of this method for detecting translocation of a protein, before step (b) a 
25 stimulus know to cause translocation of the first conjugate is applied. This stimulus could 
be stress, light, a compound or other means. Also, but before adding the translocation 
stimulus, a compound to be tested is added. If this compound is capable of preventing 
complementation, it is capable of preventing the interaction between the two interaction 
partners, of preventing the complementation as such, or of preventing the translocation of 
30 the first conjugate. 

As shown in the examples, some spontaneous fluorescence appears. It is presently 
hypothesised that this is due to the equilibrium of one or both conjugates between the 
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various cellular locations Thus, when coinciding with the other interaction partner, 
interaction and consequent complementation occurs. As the complemented protein does 
not dissociate this will constitute a sink effect (as described in detail above). This effect 
can, dependent on the system in question, be rather pronounced. The problem can be 
5 that the natural shuttling of any naturally translocating protein is exposing the anchored 
interaction partner construct to a constant supply of binding partner resulting in apparently 
irreversible binding and premature fluorophore formation. 

One way of solving this is to replace the binding interaction partners with interaction 
partners requiring an interaction stimulus to interact, often referred to as a bridging 
10 molecule. Such system provides convenient means of associating e.g. the nuclear fraction 
of a translocating protein to an anchor protein at any given time. 

One example of such interaction pairs is the FKBP12 : FRB system (as described in detail 
above). Thus, one aspect of the invention relates to a method for detecting translocation 
of a protein comprising the steps of: 
15 (a) providing a cell comprising at least: 

- a first conjugate comprising said protein, a first terminal of a complementation protein, 
and interaction partner A; and 

- a second conjugate comprising an anchor protein, a second terminal of the 
complementation protein, and interaction partner B; 

20 wherein said first conjugate has a predominant cellular location that is distinct and 
spatially separated from where said second conjugate is predominantly located, 
wherein said interaction partner A and interaction partner B bind to each other only 
when an interaction stimulus has been applied; 
(b) add an interaction stimulus; 

25 (c) determine if the first and second terminal of the complementation protein has 
complemented; 

complementation between the two terminals being indicative that said protein has 
translocated from the predominant location of the protein to the cellular location where the 
anchor protein is anchored after addition of the interaction stimulus (Figure 21). 

30 In the typical use of this method for detecting translocation of a protein, before step (b) a 
stimulus know to cause translocation of the first conjugate is applied. This stimulus could 
be stress, light, a compound or other means. Also, but before adding the translocation 
stimulus, a compound to be tested is added. If this compound is capable of prevention 
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complementation, it is capable of preventing the interaction between the two interaction 
partners, of prevention the complementation as such, or of preventing the translocation of 
the first conjugate. 

As shown in Example 11, using the FRAP system, no spontaneous interaction is 
5 observed even though the two conjugates are in the cytoplasm. Thus as discussed above, 
the efforts in keeping the proteins apart need not to be more than what the natural 
balance between the location of proteins dictates. 

Sometimes measurement of translocation perse is indeed interesting. However, typically 
10 translocation is a result of underlying cellular processes. Some of these underlying 

processes can be utilized to gain information about other processes. For example, when a 
cell is undergoing apoptosis, the activity of caspase enzymes is increased. With the 
method and materials of the present innvention it is made possible to detect if the peptide 
sequence, a consensus cleavage site for caspases has been cleaved (see figure 22). The 
15 method for determining apoptosis / caspase activity in a cell comprises at least the steps 
of: 

(a) providing a cell comprising at least: 

- a first conjugate comprising the first terminal of a complementation protein, interaction 

partner A, a nuclear localisation sequence (NLS), a Valine-Alanine-Aspartate (VAD) 
20 sequence and an anchor protein; 

- a second conjugate comprising the second terminal of a complementation protein, 

interaction partner B and an anchor protein; 
wherein the anchor protein of the first conjugate is in a different cellular location from the 
nucleus, and the anchor protein of the second conjugate is in the nucleus and 
25 wherein interaction partner A, the NLS and the first terminal of the complementation 

protein are all located on the same side of the VAD sequence and the anchor protein is 
located on the other side of the NLS; 

(b) determine if the complementation protein has complemented 
complementation being indicative that caspase has cleaved the first conjugate and the 

30 free part has translocated to the nucleus, and partner A and complementation partner B 
has complemented; 



A kit for detecting antigens comprising: 
(a) a first antibody that binds to the antigen; 
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(b) a first conjugate comprising a protein binding to the first antibody and the N-terminai of 
a complementation protein; 

(c) a second conjugate comprising a protein binding to the first antibody and the C- 
terminal of the complementation protein; 

5 whereby binding of the first and second conjugate to the first antibody will bring the two 
terminals of the complementation protein so close that a functional protein is formed. 

The thoughts of how to construct and test probes useful in the present invention are 
described below using, as a specific example, GFP as the complementing protein. As will 
10 be evident to the skilled person, similar thoughts will aply using other complementing 
proteins and the complementing fragments thereof. 

In general, a probe, i.e. a "GeneX"-GFP fusion or a GFP-"GeneX" fusion, is constructed 
using PCR with "GeneX"-specific primers followed by a cloning step to fuse "GeneX" in 
frame with GFP. The fusion may contain a short vector derived sequence between 
15 "GeneX" and GFP (e.g. part of a multiple cloning site region in the plasmid) resulting in a 
peptide linker between "GeneX" and GFP in the resulting fusion protein. 

Detailed stepwise procedure: 

- Identifying the sequence of the gene. This is most readily done by searching a 
depository of genetic information, e.g. the GenBank Sequence Database, which is widely 

20 available and routinely used by molecular biologists. In the specific examples below the 
GenBank Accession number of the gene in question is provided. 

- Design of gene-specific primers. Inspection of the sequence of the gene allows design of 
gene-specific primers to be used in a PCR reaction. Typically, the top-strand primer 
encompasses the ATG start codon of the gene and the following ca. 20 nucleotides, while 

25 the bottom-strand primer encompasses the stop codon and the ca. 20 preceding 

nucleotides, if the gene is to be fused behind GFP, i.e. a GFP-"GeneX" fusion. If the gene 
is to be fused in front of GFP, i.e. a "GeneX"-GFP fusion, a stop codon must be avoided. 
Optionally, the full length sequence of GeneX may not be used in the fusion, but merely 
the part which localizes and redistributes like GeneX in response to a signal. 

30 In addition to gene-specific sequences, the primers contain at least one recognition 
sequence for a restriction enzyme, to allow subsequent cloning of the PCR product. The 
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sites are chosen so that they are unique in the PCR product and compatible with sites in 
the cloning vector. Furthermore, it may be necessary to include an exact number of 
nucleotides between the restriction enzyme site and the gene-specific sequence in order 
to establish the correct reading frame of the fusion gene and/or a translation initiation 
5 consensus sequence. Lastly, the primers always contain a few nucleotides in front of the 
restriction enzyme site to allow efficient digestion with the enzyme. 

-Identifying a source of the gene to be amplified. In order for a PCR reaction to produce a 
product with gene-specific primers, the gene-sequence must initially be present in the 
reaction, e.g. in the form of cDNA. Information in GenBank or the scientific literature will 
10 usually indicate in which tissue(s) the gene is expressed, and cDNA libraries from a great 
variety of tissues or cell types from various species are commercially available, e.g. from 
Clontech (Palo Alto), Stratagene (La Jolla) and Invitrogen (San Diego). Many genes are 
also available in cloned form from The American Type Tissue Collection (Virginia). 

- Optimizing the PCR reaction. Several factors are known to influence the efficiency and 
15 specificity of a PCR reaction, including the annealing temperature of the primers, the 

concentration of ions, notably Mg 2+ and K\ present in the reaction, as well as pH of the 
reaction. If the result of a PCR reaction is deemed unsatisfactory, it might be because the 
parameters mentioned above are not optimal. Various annealing temperatures should be 
tested, e.g. in a PCR machine with a built-in temperature gradient, available from e.g. 
20 Stratagene (La Jolla), and/or various buffer compositions should be tried, e.g. the 
OptiPrime buffer system from Stratagene (La Jolla). 

- Cloning the PCR product. The vector into which the amplified gene product will be 
cloned and fused with GFP will already have been taken into consideration when the 
primers were designed. When choosing a vector, one should at least consider in which 

25 cell types the probe subsequently will be expressed, so that the promoter controlling 

expression of the probe is compatible with the cells. Most expression vectors also contain 
one or more selective markers, e.g. conferring resistance to a drug, which is a useful 
feature when one wants to make stable transfectants. The selective marker should also 
be compatible with the cells to be used. 

30 The actual cloning of the PCR product should present no difficulty as it typically will be a 
one-step cloning of a fragment digested with two different restriction enzymes into a 
vector digested with the same two enzymes. If the cloning proves to be problematic, it 
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may be because the restriction enzymes did not work well with the PCR fragment. In this 
case one could add longer extensions to the end of the primers to overcome a possible 
difficulty of digestion close to a fragment end, or one could introduce an intermediate 
cloning step not based on restriction enzyme digestion. Several companies offer systems 
5 for this approach, e.g. Invitrogen (San Diego) and Clontech (Palo Alto). 

Once the gene has been cloned and, in the process, fused with the GFP gene, the 
resulting product, usually a plasmid, should be carefully checked to make sure it is as 
expected. The most exact test would be to obtain the nucleotide sequence of the fusion- 
gene. 

10 Testing the probe 

Once a DNA construct for a probe has been generated, its functionality and usefulness 
may be tested. In a typical scenario, the two construes are individually validated with full 
length GFP by subjecting it to the following tests: 

- Transfecting it into cells capable of expressing the probe. The fluorescence of the cell is 
15 inspected soon after, typically the next day. At this point, two features of cellular 
fluorescence are noted: the intensity and the sub-cellular localization. 

The intensity should usually be at least as strong as that of unfused GFP in the cells. If it 
is not, the sequence or quality of the probe-DNA might be faulty, and should be carefully 



20 The sub-cellular localization is an indication of whether the probe is likely to perform well. 
If it localizes as expected for the gene in question, e.g. is excluded from the nucleus, it 
can immediately go on to a functional test. If the probe is not localized soon after the 
transfection procedure, it may be because of overexpression at this point in time, as the 
cell typically will have taken of very many copies of the plasmid, and localization will occur 

25 in time, e.g. within a few weeks, as plasmid copy number and expression level decreases. 
If localization does not occur after prolonged time, it may be because the fusion to GFP 
has destroyed a localization function, e.g. masked a protein sequence essential for 
interaction with its normal cellular anchor-protein. In this case the opposite fusion might 
work, e.g. if GeneX-GFP does not work, GFP-GeneX might, as two different parts of 

30 GeneX will be affected by the proximity to GFP. If this does not work, the proximity of GFP 



checked. 
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at either end might be a problem, and it could be attempted to increase the distance by 
incorporating a longer linker between GeneX and GFP in the DNA construct. 

If there is no prior knowledge of localization, and no localization is observed, it may be 
because the probe should not be localized at this point, because such is the nature of the 
5 protein fused to GFP. It should then be subjected to a functional test. 

In a functional test, the cells expressing the probe are treated with at least one compound 
known to perturb, usually by activating, the signalling pathway on which the probe is 
expected to report by redistributing itself within the cell. If the redistribution is as 
expected, e.g. if prior knowledge tell that it should translocate from location X to location 
10 Y, it has passed the first critical test. In this case it can go on to further characterization 
and quantification of the response. 

If it does not perform as expected, it may be because the cell lacks at least one 
component of the signalling pathway, e.g. a cell surface receptor, or there is species 
incompatibility, e.g. if the probe is modelled on sequence information of a human 
15 geneproduct, and the cell is of hamster origin. In both instances one should identify other 
cell types for the testing process where these potential problems would not apply. 

It this is satisfactory, the conjugates will be constructed with fragments of the 
complementation protein instead of full-lenght and tested when expressed in the same 
cell. If no feature of redistribution can be identified, the problem might be as mentioned 
20 above, and the probe should be retested under more optimal cellular conditions. 

If the probe does not perform under optimal cellular conditions it's back to the drawing 
board. 

Numerous cell systems for transfection exist. A few examples of mammalian cells isolated 
25 directly from tissues or organs taken from healthy or diseased animals (primary cells), or 
transformed mammalian cells capable of indefinite replication under cell culture conditions 
(cell lines). The term "mammalian cell" is intended to indicate any living cell of mammalian 
origin. The cell may be an established cell line, many of which are available from The 
American Type Culture Collection (ATCC, Virginia, USA) or similar Cell Culture 
30 Collections. The cell may be a primary cell with a limited life span derived from a 
mammalian tissue, including tissues derived from a transgenic animal, or a newly 
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established immortal cell line derived from a mammalian tissue including transgenic 
tissues, or a hybrid cell or cell line derived by fusing different cell types of mammalian 
origin e.g. hybridoma cell lines. The cells may optionally express one or more non-native 
gene products, e.g. receptors, enzymes, enzyme substrates, prior to or in addition to the 
5 fluorescent probe. Preferred cell lines include, but are not limited to, those of fibroblast 
origin, e.g. BHK, CHO, BALB, NIH-3T3 or of endothelial origin, e.g. HUVEC, BAE (bovine 
artery endothelial), CPAE (cow pulmonary artery endothelial), HLMVEC (human lung 
micro vascular endothelial cells), or of airway epithelial origin, e.g. BEAS-2B, or of 
pancreatic origin, e.g. RIN, INS-1, MIN6, bTC3, aTC6, bTC6, HIT, or of hematopoietic 
10 origin, e.g. primary isolated human monocytes, macrophages, neutrophils, basophils, 
eosinophils and lymphocyte populations, AML-14, AML-193, HL-60, RBL-1, U937, RAW, 
JAWS, or of adipocyte origin, e.g. 3T3-L1 , human pre-adipocytes, or of neuroendocrine 
origin, e.g. AtT20, PC12, GH3, muscle origin, e.g. SKMC, A10, C2C12, renal origin, e.g. 
HEK 293, LLC-PK1 , or of neuronal origin, e.g. SK-N-DZ, SK-N-BE(2), HCN-1A, NT2/D1. 

15 The examples of the present invention are based on CHO cells. Therefore, fibroblast 
derived cell lines such as BALB, NIH-3T3 and BHK cells are preferred. 

It is preferred that the heterologous conjugates are introduced into the cell as plasmids, 
e.g. individual plasmids mixed upon application to cells with a suitable transfection agent 

20 such as FuGENE so that transfected cells express and integrate all heterologous 

conjugates (or GFP fragments) simultaneously. Plasmids coding for each conjugate will 
contain a different genetic resistance marker to allow selection of cells expressing those 
conjugates. It is also preferred that each of the conjugates also contains a distinct amino 
acid sequence, such as the HA or myc or Flag markers, that may be detected 

25 immunocytochemically so that the expression of these conjugates in cells may be readily 
confirmed. Many other means for introduction of one or both of the conjugates are evenly 
feasible e.g. electroporation, calcium phosphate precipitate, microinjection, adenovirus 
and retroviral methods, bicistronic plasmids encoding both conjugates etc. 

30 Throughout the present invention, the term "protein" should have the general meaning. 
That includes not only a translated protein, a peptide or a protein fragment, but also 
chemically synthesized proteins. For proteins translated within the cell, the naturally, or 
induced, post-translational modifications such as glycosylate and lipidation are expected 
to occur and those products are still considered proteins. 
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The term "intracellular protein interaction" has the general meaning of an interaction 
between two proteins, as described above, within the same cell. The interaction is due to 
covalent and/or non-covalent forces between the protein components, most usually 
5 between one or more regions or domains on each protein whose physico-chemical 
properties allow for a more or less specific recognition and subsequent interaction 
between the two protein components involved. In a preferred embodiment, the 
intracellular interaction is a protein-protein binding: 

1 0 The recording of the fluorescence will vary according to the purpose of the method in 
question. In one embodiment the emitted light is measured with various apparatus known 
to the person skilled in the art. Typically, such apparatus comprises the following 
components: (a) a light source, (b) a method for selecting the wavelength(s) of light from 
the source that will excite the luminescence of the luminophore, (c) a device that can 

15 rapidly block or pass the excitation light into the rest of the system, (d) a series of optical 
elements for conveying the excitation light to the specimen, collecting the emitted 
fluorescence in a spatially resolved fashion, and forming an image from this fluorescence 
emission (or another type of intensity map relevant to the method of detection and 
measurement), (e) a bench or stand that holds the container of the cells being measured 

20 in a predetermined geometry with respect to the series of optical elements, (f) a detector 
to record the light intensity, preferably in the form of an image, (g) a computer or 
electronic system and associated software to acquire and store the recorded information 
and/or images, and to compute the degree of redistribution from the recorded images. 

In a preferred embodiment of the invention, the apparatus system is automated. In one 
25 embodiment, the components in (d) and (e) mentioned above comprise a fluorescence 
microscope. In one embodiment, the component in (f) mentioned above is a CCD camera. 
In one embodiment, the component in (f) mentioned above is an array of photo multiplier 
tubes/devices. 

In one embodiment of the invention, the actual fluorescence measurements are made in a 
30 standard type of fluorometer for plates of micro titer type (fluorescence plate reader). 

In one embodiment, the optical scanning system is used to illuminate the bottom of a plate 
of micro titer type so that a time-resolved recording of changes in luminescence or 
fluorescence can be made from all spatial limitations simultaneously. 
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In one embodiment, the image is formed and recorded by an optical scanning system. 

A variety of instruments exist to measure light intensity. In one embodiment a 
fluorescence plate reader is used (e.g. Wallac Victor (BD Biosciences), Spectrafluor 
5 (Tecan), Flex station (Molecular Devices), Explorer (Acumen)). In another embodiment an 
imaging plate readers is used (e.g. FLIPR (Molecular Devices) LeadSeaker (Amersham), 
VIPR (Molecular Devices)). In another embodiment an automated imager is used like 
Arrayscan (Cellomics), Incell Analyser (Amersham), Opera (Evotec). In a still further 
embodiment a confocal fluorescence microscope is used (e.g. LSM510 (Zeiss)). 

10 

One particular advantage of the present method is that it can be carried out in a 
heterogeneous cell population. This avoids inter alia the steps required to get clonal cells. 
This is achieved by fluorescence activated cell sorting (FACS) prior to testing. One step in 
that process is removal of the most green cells, that is the cells wherein functional 

15 fluorescence is achieved even though the two proteins of interest were not supposed to 
interact. Another step is removal of the black cells, that is the cells wherein the two 
heterologous conjugates do not interact e.g. where no or little functional complementation 
occurs. This could be due to lack of transfection in those cells, a poor expression ratio 
between the two constructs, or lack of functional expression of either construct. It is 

20 presently anticipated that, in both the most green cells and the black ceils, the transfection 
has not taken place as desired, resulting in no, poor, or excessive complementation of the 
heterologous conjugates. The hereby obtained "medium to low-green" cells are then used 
in any of the methods described above, or other complementation based methods. The 
"most green", "medium green", "low green" and "black" cells respectively have decreasing 

25 levels of fluorescence relative to on another. These levels are predetermined by the 
skilled artisan in relative proportions 

The preferred method for detecting interactions between proteins of interest include an 
additional FACS. The aim of this second FACS step is to isolate cells with a large 
dynamic range. The first action in the additional FACS is stimulating the "medium to low- 

30 green" FACS cells with the compound that induce interaction between two proteins of 
interest and thereafter allow sufficient time to pass to let the proteins interact and the 
fluorescent protein fragments fold and become fluorescent. The next action is to subject 
them to a second FACS removing the dark cells and removing the medium green cells. 
The remaining population of cells (most green cells or very green cells (VGC)) will have a 

35 low to medium background and are still capable of forming the fluorescent protein upon 
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interaction between the two proteins of interest. When the cells have grown to sufficient 
number, and a number of generations will have diluted the fluorescence, the cells are . 
ready to use in any of the methods outlined above, e.g. detecting compounds that induce 
interaction between two proteins of interest and to screen for compounds that interfere 
5 with a conditional interaction between two protein components. 

In a preferred aspect of the methods, the at least one cell is a mammalian cell. 

The term "compound" is intended to indicate any sample, that has a biological function or 
exerts a biological effect in a cellular system. The sample may be a sample of a biological 
material such as a sample of a body fluid including blood, plasma, saliva, milk, urine, or a 
10 microbial or plant extract, an environmental sample containing pollutants including heavy 
metals or toxins, or it may be a sample containing a compound or mixture of compounds 
prepared by organic synthesis or genetic techniques. The compound may be small or- 
ganic compounds or biopolymers, including proteins and peptides. 

In another preferred aspect of the methods, the heterologous conjugates are fusion 
15 proteins. 

One aspect of the invention relates to multiplexing split fluorophore complementation 
using different colours. By combining one fluorescent protein fragment with two or more 
appropriate complementary fragments, it is possible to determine the extent of binding of 

20 the first fluorescent protein fragment to either of the two other complementary fragments if 
the two possible fluorescent complexes have distinct fluorescence excitation or emission 
characteristics or both. Typically, the first fluorescent protein fragment will be fused to a 
protein that may bind to either of three other proteins each of them being appropriately 
fused to distinct complementary fragments. For example, the first fluorescent protein 

25 fragment can be a C-terminal fragment of enhanced GFP (EGFP, SEQ ID NO: 1) 

obtained by splitting EGFP after residue 80. The three complementary fragments can be 
appropriate N-terminal fragments of EGFP, of EGFP Y66W (SEQ ID NO: 2) and EGFP 
Y66H (SEQ ID NO: 3), respectively. The three EGFP variants have different spectral 
characteristics: 
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Fluorescent protein 


Excitation max 


Emission max 




(nm) 


(nm) 


GFP 


396 


508 


GFP Y66W 


382 


448 


GFP Y66H 


458 


480 



Reference: Heim, R., Prasher, D.C., and Tsien, R.Y. (1994) Wavelength mutations and 
posttranslational autoxidation of green fluorescent protein. Proc. Natl. Acad. Sci .U. S. A. 
91, 12501-12504. 



5 For example, fluorescent complexes produced by assembling a C-terminal half of GFP 
(e.g. residues 158 to 238) with corresponding N-terminal halves (e.g. residues 1-157) of 
GFP, GFP Y66W, or GFP Y66H will have dearly distinct fluorescence characteristics and 
the relative amounts complexes in mixtures can be calculated. 

Not only various colours can be used as exemplified above, but other physical parameters 
10 of the fluorophore can be altered e.g. intensity, fluorescent life-time, folding time etc. 

The two colour multiplexing has several uses: In most cases, as described above, the 
protein of interest is linked to the "constant" half of the fluorophore whereas its interaction 
partners each are linked to "variable" parts of the fluorophore e.g. one that upon fusion 

15 gives rise to a green fluorophore and one that gives rise to a blue fluorophore. This can 
give spatial information, ie the two different interaction partners are in different locations 
so colour will tell you where your protein of interest is. The interaction partners could also 
be in the same location so here colour gives you an indication of which interaction partner 
your protein prefers at any given time. Finally, as a special case of the latter, if your 

20 protein's interaction with the different partners is modulated by eg posttranslational 
modifications, colour can tell you whether your protein is modified or not. These three 
different setups can be used either as sensors for the physical state of your protein in the 
broadest possible sense, or they can be used as screening assays where you measure 
the ability of the test compounds to alter the ratio between the two colour readouts. 

25 Finally, colour is only one physical parameter of GFPs. Other physical parameters that 
can be localised to specific amino acids in the GFP sequence and that are easily 
detectable, such as absorption spectra, fluorescence lifetime, time for fluorophore 
maturation etc., could be employed in exactly the same way as colour. The number of 
different interaction partners need not be limited to two. 
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One special example shows how this method may be employed as a tool to use spatial 
information for screening, even if nothing is known about the interaction partners of a 
particular protein in different cellular locations (or if it has no interaction partners in one or 
more of those compartments). Cystic Fibrosis is perhaps the most frequent and well- 
5 studied protein trafficking disease. The cystic fibrosis transmembrane conductance 
regulator (CFTR) is a multi-membrane spanning protein that normally functions at the 
apical plasma membrane of airway epithelial cells as a Cl-efflux channel. The most 
common mutation, (DeltaF508) causes the protein to be retained in the endoplasmic 
reticulum (ER) and so reduces the amount of CFTR expressed in the plasma membrane 

1 0 of epithelial cells, resulting in decreased CI- efflux from the cells. It would appear from 
numerous studies that this ER retention defect of DeltaF508 is reversible, and reduced 
temperature, some small molecules, and induction of "chaperones" allow DeltaF508 to 
traffic to the plasma membrane and increase CI- permeability. One way of screening for 
compounds that modify mutant CFTR behaviour employs the split fluorophore /multiple 

15 colours concept. You could express the mutant CFTR in cells as a fusion with a zipper 
fragment fused to the "constant" half of the fluorophore. Expression in the same cell 
fusions of ER and plasma membrane markers with zippers fused to different colours will 
reveal when the CFTR mutant reaches the ER, this gives rise to one colour. If CFTR is 
moved to the PM e.g. by drug stimulus, that would give rise to a different colour. So by 

20 screening for compounds that favours the generation of the "PM colour", you could find 
molecules that specifically correct the ER retention defect of CF patients. 

One aspect of the invention thus relates to a method for generating a library of interacting 
proteins within living cells consisting of: 

25 1 . Introducing into a pool of cells two sets of plasmids, either simultaneously or 
sequentially, one set of plasmids encoding a library of proteins A each fused to the N- 
terminal half of the complementing fluorophore and the second set of plasmids encoding a 
library of proteins B each fused to the C-terminal half of the complementing fluorophore. 

2. Sorting the cells by imaging into those where a functional fluorophore has been 
30 formed in the right location and those where a functional fluorophore has not been formed 
or those where a functional fluorophore has been formed but in the wrong cellular 
compartment, the formation of said functional fluorophore being indicative of an 
interaction having occurred between proteins A and B within the right cellular 
compartment. 



WO 03/089627 




CT/DK03/00266 



32 



For example, for a protein moving from one cellular location to another cellular location in 
response to a stimuli, the binding partners in each location can be identified. 

In one embodiment of the invention, the method relates to identification of drugs that will 
5 cause disruption of binding between two proteins when located in one cellular 
compartment or location but not in another cellular compartment or location. This 
embodiment is carried out essentially as described above with the only difference that 
instead of sorting the cells based on intensity, the cells are imaged with standard imaging 
equipment to determine not only if binding has taken place, but also where such binding 
10 has occurred. 

This system is also useful for screening for fluorophores with novel properties such as 
those that can be used in the split fluorophore complemensation usages described above. 
By mutagenisis of both the N- and C-termini of GFP in this system, the system is used to 

15 screen in a combinatorial manner for double (or more) mutants of the fluorophore with 
novel properties. This gives a wider selection to choose from. Furthermore, as the two 
mutations are in different halves of the molecule, they could be additive or compensatory, 
or both. Finally, the fact that they have been found using the split fluorophore 
complementation system immediately means that they can be used as sensors in this 

20 system. 



The invention will be illustrated more specifically in the following non-limiting examples. 
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Examples 



Example 1: Alignment of fluorescent proteins 



GenBank entry 


Fluorescent protein 


P42212 


Aequorea victoria green-fluorescent protein 


AF372525 


Renilla reniformis green fluorescent protein 


AY015996 


Renilla muelleri green fluorescent protein 


AY013824 


Aequorea macrodactyla isolate GFPxm 


AF384683 


Montastraea cavernosa green fluorescent protein 


AF401282 


Montastraea faveolata green fluorescent protein 


AY015995 


Ptilosarcus sp. CSG-2001 green fluorescent protein 


AF322221 


Anemonia sulcata green fluorescent protein asFP499 


AF322222 


Anemonia sulcata nonfluorescent red protein asCP562 


AF246709 


Anemonia sulcata GFP-like chromoprotein FP595 


AF168419 


DsRed Discosoma sp. fluorescent protein FP583 


AF1 68420 


Discosoma striata fluorescent protein FP483 


AF1 68421 


Anemonia majano fluorescent protein FP486 


AF1 68422 


Zoanthus sp. fluorescent protein FP506 


AF1 68423 


Zoanthus sp. fluorescent protein FP538 


AF1 68424 


Clavularia sp. fluorescent protein FP484 



The alignment is presented in Figure 16. 



5 Example 2: Construction ofEGFP complementation fragment probes 

Anti-parallel leucine zippers (called NZ and CZ) that can bind to each other within 
prokaryotic and eukaryotic were fused to different fragments of GFP to evaluate the 
optimal site for splitting GFP for use of such fragments in molecular complementation 
experiments, including fluorescence reassembly experiments. NZ and CZ leucine zippers 

10 were prepared by annealing and ligating phosphorylated oligo nucleotides 21 10-21 15 (for 
NZ zipper, see Table 2) or phosphorylated oligo nucleotides 21 16-2121 (for CZ zipper), 
into Ncol-BamHI cut pTrcHis-A vector (commercially available from Invitrogen) producing 
vector PS1515 (expression vector encoding NZ zipper) or PS1516 (expression vector 
encoding CZ zipper). The oligos ligated in NZ and CZ annealing mixes 1 produced the 

1 5 coding sequences of the N-terminal parts of the NZ and CZ zippers. The oligos ligated in 
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NZ and CZ annealing mixes 2 produced the coding sequences of the middle parts of the 
NZ and CZ zippers and the oligos ligated in NZ and CZ annealing mixes 3 produced the 
coding sequences of the C-terminal parts of the NZ and CZ zippers. 

Annealing primer pairs for NZ zipper 



5 NZ annealing mix 1 

Forward oligo 21 1 0 (1 p.M) 5 \i\ 

Reverse oligo 21 1 1 (1 5 \x\ 

50 mM Tris-HCI, 1 0 mM MgCI 2 , pH 8.0 2 jal 

H 2 0 8[i\ 

NZ annealing mix 2 

Forward oligo 21 12 (1 jiM) 5 pJ 

Reverse oligo 21 1 3 (1 jiM) 5 nl 

50 mM Tris-HCI, 10 mM MgCI 2 , pH 8.0 2 \i\ 

H 2 Q 8 \x\ 



NZ annealing mix 3 

Forward oligo 2114 (1 \xhA) 5 \i\ 

Reverse oligo 21 1 5 (1 \xM) 5 \i\ 

50 mM Tris-HCI, 10 mM MgCI 2 , pH 8.0 2 

H 2 0 8 nl 



Each of the annealing mixes were heated at 80°C for 2 minutes on a pre-heated Hybaid 
OmniGene PCR machine which was subsequently turned off and allowed to cool to room 
temperature (about 10 min). The fragments were subsequently put on ice. 
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Annealing primer pairs for CZ zipper 
CZ annealing mix 1 

Forward oligo 21 16 (1 pM) 5 pi 

Reverse oligo 21 17 (1 pM) 5 pi 

50 mM Tris-HCI, 10 mM MgCI 2> pH 8.0 2 pi 
H 2 0 8 pi 



CZ annealing mix 2 

Forward oligo 21 18 (1 pM) 5 pi 

Reverse oligo 21 1 9 (1 pM) 5 pi 

50 mM Tris-HCI, 10 mM MgCI 2l pH 8.0 2 pi 

H 2 Q 8 pi 



CZ annealing mix 3 

Forward oligo 2120 (1 pM) 5 pi 

Reverse oligo 2121 (1 pM) 5 pi 

50 mM Tris-HCI, 10 mM MgCI 2 , pH 8.0 2 pi 

H 2 0 8 pi 



Each of the annealing mixes were heated at 80°C for 2 minutes on a pre-heated Hybaid 
OmniGene PCR machine which was subsequently turned off and allowed to cool to room 
10 temperature (about 10 min). The fragments were subsequently put on ice. 



Restriction digestion ofpTrcHis-A prokaryotic expression vector 

The pTrcHis-A prokaryotic expression vector, cut with Ncol and BamHI restriction 
enzymes and gel purified, was used for cloning the prepared NZ and CZ leucine zipper 
coding sequences: 
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Restriction digestion of pTrcHis-A vector 

pTrcHis-A (1 jag/jal) 2 pi 

Ncoi(10U/pl) 1 p| 

Nhel (5 U/jal), optional 0.5 fx! 

BamHI (20 U/pl) 1 pi 

100xBSA 0.4 pi 

10x NEB (New England Biolabs, NEB) BamHI buffer 3 pi 

H 2 0 23 p! 

Calf intestinal phosphatase (optional, last 20 min only) 0.5 jil 



The vector was digested for about 1 hour at 37°C and purified by agarose gel 
electrophoresis. The desired vector fragment was recovered from the gel using the 
5 QIAquick Gel Extraction kit (spin columns) from Qiagen and recovered in 50 jxl of elution 
buffer. Nhel, which cuts between Ncol and BamHI, was included to minimise the amounts 
of uncut and self-Iigating vector. 

Ligation and transformation of annealed NZ oligo pairs 

Each of the three NZ annealing mixtures 1-3 was diluted 50-fold (1 jil of mixture in 50 \i\ of 
10 H 2 0) and mixed and ligated into the cut vector as follows (three hours at 20-24°C): 

Ligation of NZ zipper fragments into pTrcHis-A vector . 



Annealing mix 1 1 pi 

Annealing mix 2 1 p| 

Annealing mix 3 1 p| 

10x T4 DNA ligase buffer (New England Biolabs) 1 pi 

T4 DNA ligase (400 U/^il, New England Biolabs) 0.5 pi 

pTrcHis-A (Ncol + BamHI cut) 0.5 pi 

H 2 0 5 pi 



Alternatively, the fragments in NZ annealing mixes 1, 2, and 3 can be ligated in absence 
of vector and purified by agarose gel electrophoresis before being ligated into the Ncol- 
15 BamHI cut vector. The annealed and ligated oligo nucleotides from annealing mixes 1-3 
had single stranded terminal overhangs that were compatible with the overhangs that 
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were generated by Ncol and BamHI restriction digestion of pTrcHis-A. After ligation of the 
fragment into cut pTrcHis-A, the Ncol and BamHI sites were regenerated. 



Following ligation into the vector, 2 p.1 of the ligation mixture was transformed into 50 jol of 
One Shot TOP10 chemically competent E. coli cells (Invitrogen) following the 
5 manufacturers protocol. The ligation can be performed using different amounts or 
volumes fragments and buffers. The inserted DNA sequence (SEQ ID NO: 7) and the 
encoded NZ zipper peptide (SEQ ID NO: 8) are as follows: 

MAGGTGSGAL KKE LQANKKE 
CCATGG CCGGTGGT ACCGGT TCCGGTGCCCTGAAGAAGGAGCTGCAGGCCAACAAGAAGGAG 

10 

LAQL KWELQALKKELAQ * D 
CTGGC C CAGCTGAAGTGGGAGCTGCAGGCCCTGAAGAAGGAGCTGGC C CAGT AGGATC C 

The Gly-Gly-Thr-Gly-Ser-Gly amino acid sequence in the terminus is part of the linker 
1 5 sequence that was inserted between the NZ zipper peptide and the N-terminal fragments 
of EGFP (NtermEGFP). The zipper sequence in the NtermEGFP-NZ fusion protein is also 
Gly-Gly-Thr-Gly-Ser-Gly with the Gly-Gly-Thr-Gly coding sequence being repeated in the 
NtermEGFP reverse amplification primers 2129, 2130, and 2131 (Table 3). Underlined 
are the unique Ncol (ccatgg), Agel (accggt) and BamHI (ggatcc) sites used for 
20 cloning of the zipper peptide into pTrcHis-A and the NtermEGFP-NZ fragments into the 
NZ zipper vector PS1515 (see below). The asterisk (*) shows a stop codon. 

Ligation and transformation of annealed CZ oligo pairs 

Each of the three CZ annealing mixtures 4-6 was diluted 50-fold (1 jlxI of mixture in 50 jil of 
H 2 0) and mixed as follows: 
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Ligation of CZ zipper fragments into pTrcHis-A vector 

CZ annealing mix 1 1 • 

CZ annealing mix 2 1 \x\ 

CZ annealing mix 3 1 jxl 

10x T4 DNA ligase buffer (New England Biolabs) 1 jnl 

T4 DNA ligase (400 U/jllI, New England Biolabs) 0.5 \i\ 

pTrcHis-A (Ncol + BamHI cut) 0.5 |il 

H 2 0 5 ]i\ 



Alternatively, the fragments in CZ annealing mixes 1, 2, and 3 can be ligated in absence 
of vector and purified by agarose gel electrophoresis before being ligated into the Ncol- 
5 BamHI cut vector. The annealed and ligated oligo nucleotides from annealing mixes 1-3 
had single stranded terminal overhangs that were compatible with the overhangs that 
were generated by Ncol and BamHI restriction digestion of pTrcHis-A. After ligation of the 
fragment into cut pTrcHis-A, the Ncol and BamHI sites were regenerated. 

Following ligation into the vector, 2 |xl of the ligation mixture were transformation into 50 \x\ 
10 of One Shot TOP1 0 chemically competent E. coli cells (Invitrogen) following the 
manufacturers protocol. The ligation can be performed using different amounts or 
volumes fragments and buffers. The inserted DNA sequence (SEQ ID NO: 9) and the 
encoded CZ zipper peptide (SEQ ID NO: 10) are as follows: 

MAS E QLE KKLQALE.KKLAQL 
1 5 CCATGGCCAGCGAGCAGC TGGAGAAGAAGCTGCAGGCC C TGGAGAAGAAGCTGGCC CAGCTG 

EWKNQALE KKLAQGGT G * 
GAGTGGAAGAAC CAGGC C CTGGAGAAGAAGCTGGCCC AGGGCGGCACCGGTTAGGATC C 



20 The Gly-Gly-Thr-Gly amino acid sequence in the terminus is part of the linker sequence 
that was inserted between the CZ zipper peptide and the C-terminal fragments of EGFP 
(CtermEGFP). The zipper sequence in the CZ-CtermEGFP fusion protein is also Gly-Gly- 
Thr-Gly with the Thr-Gly coding sequence being repeated in the CtermEGFP forward 
amplification primers 2133, 2134, and 2135 (Table 3). Underlined are the unique Ncol 

25 (ccatgg), Agel (accggt) and BamHI (ggatcc) sites used for cloning of the zipper 
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peptide into pTrcHis-A and the CZ-CtermEGFP fragments into the CZ zipper vector 
PS1516 (see below). The asterisk (*) shows a stop codon. 



Example 3: E. coli colony PCR screen, plasmid miniprep and DNA 
sequencing 

5 The transformed bacteria were plated on Luria Broth (LB) agar plates containing 100 
|ag/ml of carbenicillin as selection (present in used E. coli media). To quickly identify 
transformants containing the desired NZ or CZ constructs, colony PCR screening was 
performed using oligos 21 10 (5' forward NZ oligo) and 21 15 (3' reverse NZ oligo) or using 
oligos 21 16 (5' forward CZ oligo) and 2121 (3' reverse CZ oligo): 

10 Per sample (15 ml reaction volume) 

10x Taq polymerase buffer (Perkin Elmer) 1 .5 jliJ 



dNTP (5 mM nucleotide, each) 0.3 jil 

50 mM MgCI 2 0.6 |xl 

Dimethyl sulphoxide (DMSO) 0.3 \i\ 

Taq polymerase (Perkin Elmer) 0.2 |xl 

5' forward primer (10 j^M) 0.5 \i\ 

3' reverse primer (10 \xM) 0.5 |xl 

H 2 0 6.1 

Transformant resuspended in H 2 0 5.0 



Cycling parameters (RoboCvcler Gradient 96. Strataqene) 
Initial denaturation at 94°C for 3 min followed by 25 cycles of (all steps of 1 min): 
Denaturation at 94°C, primer annealing at 53°C and primer extension at 72°C. 
15 Finally, an additional extension step at 72°C was included (5 min). 

16 NZ transformants and 16 CZ transformants were screened. PCR fragments having the 
expected product sizes of about 120 base pairs were amplified from 14 NZ clones and 15 
CZ clones, as determined by agarose gel electrophoresis analysis. 

Three of the positive colonies were picked from each transformation (NZ and CZ) and 
20 used to inoculate 5 ml of liquid LB medium. After culturing at 37°C over night, plasmid 
DNA was purified by mini preparations using the QIAprep kit from Qiagen. 



WO 03/089627 




CT/DK03/00266 



40 

Plasmids containing correct NZ (PS1515) or CZ (PS1516) fragment inserts were identified 
by DNA sequencing on an ABI PRISM model 377 DNA sequencer using forward 
sequencing primer 1282. 



Example 4: Prokaryotic expression vectors encoding fusion proteins of 
5 EGFP fragment and zipper 

The DNA sequences encoding the NZ and CZ zippers in the prokaryotic expression 
vectors PS1515 and PS1516, respectively, can be fused to DNA sequences encoding 
desired EGFP fragments (N-terminal fragments of EGFP are called NtermEGFP and C- 
terminal fragments of EGFP are called CtermEGFP) or other fragments using the unique 
10 Agel restriction sites appropriately located in linker sequences in either the 5' end (as in 
the NZ vector PS1515) or in the 3' end (as in the CZ vector PS1516) of the leucine zipper 
coding sequence in combination with either of the unique Ncol or BamHI sites used for 
cloning the zipper coding fragments (DNA and amino acid sequences are shown above). 
The general structures of the fusion protein coding sequences are shown in Figure 1. 



15 For example, to prepare a prokaryotic expression vector encoding a fusion protein 
consisting of NZ zipper N-terminally fused to an NtermEGFP fragment (that is, fused to 
the C terminal of the NtermEGFP fragment), e.g. residues 1-172 (NtermEGFP172), this 
region of the EGFP coding sequence in the commercial expression vector pEGFP-C1 
(Clontech) was amplified by PGR using forward, oligo 2128 (containing a unique Ncol site) 

20 and reverse oligo 2131 (containing a unique Agel site) in accordance with Table 3. 



Per sample (50 ul reaction volume) 




10x Pfu polymerase buffer (Stratagene) 


5.0 ^il 


dNTP (5 mM nucleotide, each) 


1.0 \i\ 


Pfu Hot Start polymerase (Stratagene) 


1.0 |il 


5' forward primer (10 jiM) 


1.0 ul 


3* reverse primer (10 ^iM) 


1.0 ul 


pEGFP-C1 vector (10 ng/^l) 


2.0 ul 


H 2 Q 


39.0 ^l 



Cycling parameters (Hvbaid OmniGene PCR machine) 



Initial denaturation at 94°C for 3 min followed by 25 cycles of (all steps of 1 min): 
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Denaturation at 94°C, primer annealing at 53°C and primer extension at 72°C. 
Finally, an additional extension step at 72°C was included (5 min). 



The PCR fragment encoding the desired EGFP fragment, e.g. the above mentioned 
5 fragment composed of residues 1-172, with appropriately engineered terminal restriction 
sites contained in the primer sequences was then gel purified as described above cut with 
Ncol and Agel or Agel and BamHI and ligated into the constructed NZ or CZ prokaryotic 
leucine zipper expression vectors PS1515 or PS1516 cut with the same enzymes and gel 
purified: 

10 Restriction digestion of NtermEGFP and CtermEGFP PCR fragments 

EGFP fragment (gel purified) 26 

Ncol (10 U/|il) or BamHI (20. U/^l) 0.5 jil 

Agel (10 U/jllI) 1.0^1 

10x New England Biolabs buffer 2 3 



Restriction digestion of NZ (PS1515) and CZ (PS1516) vectors 

Vector (1 ^ig/jLiI) 1 .0 [x\ 

Ncol (10 Ufpi) or BamHI (20 U/jil) 0.33 \i\ 

Agel (10 U/nl) 0.66 jlxI 

10x New England Biolabs buffer 2 1 

H 2 Q 7 



15 



All enzymes were from New England Biolabs. The DNA preparations were digested for 1 
hour at 37°C and gel purified. 
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Ligation of EGFP fragments into cut PS1515 or PS1516 vector 



Cut and purified vector 

Cut and purified NtermEGFP or CtermEGFP 
fragment 

10x T4 DNA ligase buffer (New England Biolabs) 
T4 DNA ligase (400 U/|il, New England Biolabs) 
H 2 0 



2 \i\ 
4\x\ 



1 |il 
0.5 nl 
2.5 \x\ 



Ligation proceeded for 30 min at 22°C after which 2 fxl of each ligation mixture were 
transformed into 50 \x\ of One Shot TOP10 chemically competent E. coli cells (Invitrogen). 
5 The transformed cells were plated on LB plates containing carbenicillin and plasmids were 
prepared from two colonies from each transformation as described above. 

Example 5: EGFP based reassembly in £. coli 

Plasmids that expressed functional NtermEGFP-NZ or CZ-CtermEGFP complementation 
constructs were identified by co-transforming 10 jJ of One Shot TOP10 chemically 
10 competent E. coli cells (Invitrogen) with 1 \x\ of each of appropriately matched 
NtermEGFP-NZ or CZ-CtermEGFP plasmids (i.e., plasmids that express EGFP 
fragments, said fragments are truncated after (NtermEGFP fragments) or before 
(CtermEGFP fragments) the same splitting site and plating the co-transformed cells on LB 
plates containing carbenicillin and 5 mM of isopropyl-fi-thiogalactoside (IPTG). 

15 The transformed cells were grown over night at 37°C. E. coli colonies that were green 
fluorescent because of EGFP based reassembly were visible on the agar plate without 
magnification about 10-20 hours after transfection (the fluorescence developed further 
during storage of the plates at 5°C for one or more days) when illuminated with a blue 
light source (Fiberoptic-Heim LQ2600) and viewed through yellow filter glasses. 

20 Functional complementation was clearly visible in cells co-transformed with 

complementation constructs based on splits between either residues 157 and 158 or 
between residues 172 and 173 and the DNA sequences of expression vectors that 
produced functional NtermEGFP-NZ or CZ-CtermEGFP complementation fragments 
(named PS1594, PS1595, PS1596, PS1597, see Table 4) were verified by DNA 

25 sequencing using primer 1282 as previously described. 
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Surprisingly, the E. coli colonies of cells co-transformed with the vectors expressing the 
EGFP complementation fragments with split in the Ile171-Ser175 loop (namely between 
residues 172 and 173, vectors PS1595 and PS1597) were significantly more fluorescent 
than the colonies of cells that were co-transformed with vectors expressing EGFP 
5 complementation fragments that were split in the Ala154-Gly160 loop (namely between 
residues 157 and 158, vectors PS1594 and PS1596). 

Functional complementation was not clearly visible in cells co-transformed with 
complementation constructs based on a split between residues 144 and 145. DNA 
sequencing confirmed that expression vectors PS1614 and PS1615 encoded the correct 
10 NtermEGFP-NZ and CZ-CtermEGFP reassembly fragments, respectively. 

Example 6: Eukaryotic expression vectors encoding fusion proteins of EGFP 
fragment and zipper 

Because of the low fluorescence signal produced by the complementation fragments 
based on the 144/145 split fragments, only the complementation fragments that were 
15 based on splits at residues 157/158 or 172/173 were transferred to an eukaryotic 

expression system to permit evaluation of fragment complementation in mammalian cells. 

NtermEGFP-NZ fragments in PS1596 and PS1597, and CZ-CtermEGFP fragments in 
PS1594 and PS1595, are flanked by an Ncol site 5' to the start codons and a BamHI site 
3' to the stop codons. The fragments were transferred as blunt-ended Ncol/BamHI . 
20 fragments into mammalian expression vectors cut with Eco47III/BamHI. To select for 
stable expression of both an NtermEGFP-NZ and a CZ-CtermEGFP expressing plasmid, 
the expression vectors for NtermEGFP-NZ fragments and CZ-CtermEGFP fragments 
contain selection markers for neomycin/geneticin/G418 and zeocin, respectively. 



25 



Plasmids PS1594, PS1595, PS1596, and PS1597 were cut with Ncol restriction enzyme, 
blunt-ended with Klenow fragment, gel purified, cut with BamHI and gel purified as 
described below. 
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Restriction digestion of NtermEGFP-NZ and CZ-CtermEGFP prokaryotic expression 
vectors 

PS1 594, PS1 595, PS1 596, or PS1 597 (1 ng/nl) 1 pi 
Ncol (10 U/|il, from New England Biolabs) 1 \x\ 

10x buffer 4 (NEB) 3 jliI 

H 2 0 25 \x\ 



The plasmids were digested for about 1 hour at 37°C. 1 jal of 1 mM dNTP mix and 1 unit 
5 of Klenow fragment (New England Biolabs) were added and the reactions were incubated 
30 minutes at room temperature. The linear plasmid fragments were purified by agarose 
gel electrophoresis and recovered from the gel using the QIAquick Gel Extraction kit (spin 
columns) from Qiagen and recovered in 50 pi of elution buffer. 5 jjJ BamHI buffer (New 
England Biolabs) and 10 units BamHI enzyme were added. The plasmids were digested 
10 for about 1 hour at 37°C. The desired plasmid fragments were purified by agarose gel 
electrophoresis and recovered from the gel using the QIAquick Gel Extraction kit (spin 
columns) from Qiagen and recovered in 50 jnl of elution buffer. 

To stably co-express NtermEGFP-NZ and CZ-CtermEGFP fragments in the same 
mammalian cell, mammalian expression vectors carrying different selection markers were 
15 required. To obtain this, the kanamycin/neomycin selection marker on the expression 
vector pEGFP-C1 was replaced with a zeocin resistance marker resulting in the plasmid 
referred to as PS0609. 

Replacement of kanamycin/neomycin marker on pEGFP-C1 with zeocin marker. 

pEGFP-C1 was digested with Avrll, which excises the kanamycin/neomycin selection 
20 marker, and following gel purification, the vector fragment was ligated with an 

approximately 0.5 kbp Avrll fragment encoding zeocin resistance. This fragment was 
isolated by PCR amplification of the zeocin selection marker on plasmid pZeoSV 
(Invitrogen) using primers 9655 and 9658 (see Table 2). Both primers contain Avrll 
cloning sites and flank the zeocin resistance gene on plasmid pZeoSV including its E. coli 
25 promoter. The top primer 9658 spans the Asel site at the beginning of zeocin, which can 
be used to determine the orientation of the Avrll insert relative to the SV40 promoter 
which drives resistance in mammalian cells. The resulting plasmid is referred to as 
PS0609. 
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Plasmids pEGFP-C1 (Clontech) and its zeocin-resistant derivative PS0609 were cut with 
Eco47lll restriction enzyme, gel purified, cut with BamHI and gel purified as described 
below. These steps excise EGFP and leave the rest of the vectors intact. 

Restriction digestion of eukarvotic expression vectors 

pEGFP-CI or PS0609 DNA (1 ^ig/pj) 0.5 \i\ 

Eco47lll (10 U/|al, from Promega) 1 \x\ 

10x buffer D (Promega) 3 

H 2 0 25.5 jul 

5 



The plasmids were digested for about 1 hour at 37°C. The linear plasmid fragments were 
purified by agarose gel electrophoresis and recovered from the gel using the QIAquick 
, Gel Extraction kit (spin columns) from Qiagen and recovered in 50 jllI of elution buffer. 5 jjJ 
BamHI buffer (New England Biolabs) and 10 units BamHI enzyme were added. The 
10 plasmids were digested for about 1 hour at 37°C. The desired vector fragments were 
purified by agarose gel electrophoresis and recovered from the gel using the QIAquick 
Gel Extraction kit (spin columns) from Qiagen and recovered in 50 y\ of elution buffer. 

Ligation of NtermEGFP-NZ fragments into pEGFP-CI and CZ-CtermEGFP fragments into 
PS0609 

Cut and purified vector fragment 1 

Cut and purified NtermEGFP-NZ or CZ-CtermEGFP 3 \i\ 
fragment 

10x T4 DNA ligase buffer (New England Biolabs) 1 

T4 DNA ligase (400 U/|il, New England Biolabs) 0.5 \i\ 

H2Q 5 



Ligation reactions were incubated at 16°C overnight. 3 \x\ were transformed into One Shot 
TOP10 chemically competent E. coli cells (Invitrogen) and transformants were selected on 
imMedia with kanamycin or imMedia with zeocin (both from Invitrogen) for pEGFP-C1 and 
PS0609 derivatives, respectively. 

20 4 transformants from each transformation plate were picked in imMedia medium with 
appropriate selection (kanamycin or zeocin) and grown at 37 degrees C for 6 hours. 
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Plasmid DNA was isolated by the QIAprep spin column method (Qiagen) and analysed by 
restriction digests with Asel and Mlul. The DNA sequences of the inserts were finally 
verified by sequencing as described above. The resulting plasmids were named PS1557, 
PS1558, PS1559, and PS1560 (Table 4). 



5 Example 7: EGFP based reassembly in mammalian cells 

To establish cells lines that express EGFP fragment/zipper fusion proteins, CHO-hIR cells 
were transfected with plasmid pairs resulting in two cell lines 1) CHO-hIR 
PS1559+PS1557, and 2) CHO-hIR PS1560+PS1558. The CHO-hIR cell line consists of 
CHO-K1 (ATCC CCL-61) cells that have been stably transfected with the human insulin 

10 receptor ((MR, GenBank Acc# M1 0051) as described in: Hansen, B. F., Danielsen, G. M., 
Drejer, K., S0rensen, A. R., Wiberg, F. C, Klein, H. H., Lundemose, A. G. (1996) 
Sustained signalling from the insulin receptor after stimulation with insulin analogues 
exhibiting increased mitogenic potency. Biochem. J. Apr 1; 315 ( Pt 1):271-279)<. The 
selection marker for the vector is methotrexate (MTX). The hIR expression is very stable 

15 in the CHO-hIR cells, without selection pressure, because of the insulin-sensitivity of the 
cell line and a very stable phenotype can be maintained without selection pressure. 

Stable cells were obtained by cell growth in selection medium containing Geneticin and 
Zeocin. 

CHO-hIR cells were transfected using Fugene (Roche) according to the manufacturer's 
20 instructions. The day after transfection, cells were examined for transient expression, split 
1:10 and exposed to selection medium (growth medium supplemented with 500 pg/ml 
geneticin (Invitrogen) and 1 mg/ml zeocin (Cayla). The cells lines were stable after 2-3 
weeks of culture in selection medium. 



The growth medium used was NUT.MIX F-12 (Ham's) with GLUTAMAX-1 
25 (Gibco/lnvitrogen) supplemented with 10% fetal bovine serum (JRH Biosciences) and 1% 
Penicillin-Streptomycin (10,000 IU/ml f Gibco/lnvitrogen). The CHO-hIR cells were cultured 
in growth medium, and split 1:4 to 1:16 twice a week according to standard cell culture 
protocols. The CHO-hIR PS1559+PS1557 and CHO-hIR PS1560+PS1558 were treated 
likewise, except that the growth medium was supplemented with 500 [ig/m\ geneticin 
30 (Invitrogen) and 1 mg/ml zeocin (Cayla) at all times. 
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Images of three CHO-hIR cell lines separately transfected with pEGFP-C1 (expressing 
EGFP with a short C-terminal extension), PS1559 + PS1557 (expressing EGFP 
complementation fragments split at 157-158, NtermEGFP157-NZ + CZ-CtermEGFP158) 
and with PS1560 + PS1558 (expressing EGFP complementation fragments split at 172- 
5 173, NtermEGFPI 72-NZ + CZ-CtermEGFP173) were collected 1 day, 2 days and 10 
days after transfection to assess the relative brightness of cells expressing the different 
complementation constructs. Images were collected on a Nikon Diaphot 300 equipped for 
epifluorescence work. Light source for epifluorescence was a Nikon 100W Hg arc lamp, 
coupled to the microscope through a custom quartz fibre illuminator (TILL Photonics 

10 GmbH, Planegg, Germany). Excitation light passed through a 450-490 nm bandpass filter 
(Delta Light and Optics, Lyngby, Denmark) and was directed to the specimen via a 
Chroma 72100 505 nm cut-on dichroic mirror (Chroma Technology, Brattleboro, VT, 
USA). A x40 NA1.3 oil immersion lens was used for all images. Emitted light passed 
through a 540-550 bandpass filter (Chroma) to a Hammamatsu Orca ER camera. All 

15 images were collected with 50 millisecond exposure time, chosen to ensure non- 
saturation of images for even the brightest (EGFP-expressing) cells in each optical field 
(maximum pixel count <4095). Imaging software used to acquire images on this system 
was IPLab for Windows (Scanalytics, USA). 

Presentation and analysis of images 

20 The microscope images were analysed using the I mage J software package, the public 
domain image analysis software written by Wayne Rasband of the US National Institute of 
Health (http://rsb.info.nih.gov/ij/) and the data analysis was performed in Microsoft Excel. 
The images shown in Figure 2 are of fluorescent CHO-hIR cells co-transfected with 
different NtermEGFP-NZ and CZ-CtermEGFP expression vectors or transfected with 

25 pEGFP-C1 . The images are scaled individually to visualise the cells and the fluorescence 
distribution within them. Because of this scaling, the relative fluorescence levels cannot be 
compared between the images. When the same images are scaled identically they appear 
as in Figure 3 and it is apparent that the cells that are transfected with complementation 
constructs that are based on a split between residues 172 and 173 are significantly more 

30 fluorescent than the cells that are transfected with complementation constructs that are 
based on a split between residues 157 and 158. However, the cells transfected with the 
pEGFP-C1 construct show significantly stronger fluorescence on day 2. 



WO 03/089627 ^^CT/DK03/00266 

48 

The same images were analysed for background and maximum fluorescence intensities 
using the ImageJ software package (Figure 4). From the figure, it is clear that a split 
between residues 172 and 173, and probably anywhere else in this loop, is greatly 
superior to a split between residues 157 and 158 and probably also to splits anywhere 
5 else in this loop. 



Example 8: Eukaryotic expression vectors encoding EYFP and EYFP variant 
F64L fragment/zipper fusion proteins 

Mutagenesis of the eukaryotic NtermEGFP-NZ expression vectors PS1559 
(NtermEGFPI 57-NZ) and PS1560 (NtermEGFPI 72-NZ) into the corresponding N- 

10 terminal EYFP (SEQ ID NO: 5) fragment (NtermEYFP-NZ) variants and mutagenesis of 
. the eukaryotic CtermEGFP expression vectors PS1557 (CZ-CtermEGFP158) and 
PS1558 (CZ-CtermEGFP173) into the corresponding C-terminal EYFP fragment (CZ- 
CtermEYFP) variants was accomplished by site directed mutagenesis using the 
QuickChange kit and by following the manufacturers instructions (Stratagene). Primers 

15 2333 and 2334 were used to convert expression vectors PS1559 (NtermEGFPI 57-NZ) 
and PS1560 (NtermEGFPI 72-NZ) into N-terminal EYFP fragment expression vectors 
PS1639 (NtermEYFPI 57-NZ) and PS1642 (NtermEYFPI 72-NZ). The introduced 
mutations were: L64F:T65G:V68L:S72A. Furthermore, primers 2335 and 2336 were used 
to convert expression vectors PS1559 (NtermEGFPI 57-NZ) and PS1560 

20 (NtermEGFPI 72-NZ) into F64L mutated N-terminal EYFP fragment expression vectors 
PS1640 (NtermE[F64L]YFP1 57-NZ) and PS1641 (NtermE[F64L]YFP1 72-NZ). The 
introduced mutations were: T65G:V68L:S72A. Accordingly, the expressed NtermEYFP 
fragments have the following amino acid sequences (only residues 64-72 are shown): 





64 


65 


66 


67 


68 


69 


70 


71 


72 


NtermEGFP 
(template) 


L 


T 


Y 


G 


V 


Q 


C 


F 


S 


NtermEYFP 

(L64F:T65G:V68L:S72A) 


F 


G 


Y 


G 


L 


Q 


C 


F 


A 


NtermE[F64L]YFP 
(T65G:V68L:S72A) 


L 


G 


Y 


G 


L 


Q 


C 


F 


A 



25 Finally, primers 2337 and 2338 were used to convert expression vectors PS1557 (CZ- 
CtermEGFP158) and PS1558 (CZ-CtermEGFP173) into C-terminal EYFP fragment 
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expression vectors PS1637 (CZ-CtermEYFP158) and PS1638 (CZ-CtermEYFP173) by 
introducing a T203Y mutation. All sequences were verified by DNA sequencing of the 
vectors and all primer sequences are shown in Table 2. 



Example 9: EYFP based reassembly in mammalian cells 

5 The constructed EYFP based split fluorescent protein expression vectors PS1637 to 
PS1642 described above were investigated in mammalian cells in parallel with the EGFP 
based split fluorescent protein expression vectors PS1557 to PS1560 described in 
Example 7 and using the same experimental set-up (including the same filter set) and 
procedures (including the image analysis procedure) except that all images were 
1 0 produced using 1 0 ms exposure times instead of 50 ms exposure times, because of the 
increased brightness of the probes, and a 20x objective was used instead of a 40x 
objective to image more cells. Other appropriate filter sets could have been used. The 
images are taken the day after transfection (day 1 ). 

It is apparent from the identically scaled fluorescence images of the transfected cells 
15 (Figure 6) that the split site between residues 172 and 173 is again shown to be superior 
to the split site between residues 157 and 158. Furthermore, it is apparent that 
complementation based on EYFP fragments is superior to complementation based on 
EGFP fragments. Surprisingly, introduction of the F64L mutation from EGFP into the N- . 
terminal EYFP fragments further greatly enhanced the fluorescence of the complementing 
20 fragments. As can be seen from the images, the positive effects of using the optimal 
splitting site (between residues 172 and 173) using the optimal fluorescent protein colour 
variant (EYFP) and introducing the F64L folding mutation into the NtermEYFP fragment, 
are additive. Quantification of these observation was done by analysing the images shown 
in Figure 6 and the numeric out-put is presented in Figure 7. 

25 Effects of colour (yellow better): 



Good 




Better 


EGFP 


vs 


EYFP 


NtermEGFP157-NZ + 


vs 


NtermEYFPI 57-NZ + 


CZ-CtermEGFP158 




CZ-CtermEYFP158 


NtermEGFP172-NZ + 


vs 


NtermEYFPI 72-NZ + 


CZ-CtermEGFP173 




CZ-CtermEYFP173 
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Effects of split site (172/173 better): 



Good 




Better 


NtermEGFP157-NZ + 
CZ-CtermEGFP158 


vs 


NtermEGFP172-NZ + 
CZ-CtermEGFP173 


NtermEYFP157-NZ + 
CZ-CtermEYFP158 


vs 


NtermEYFPI 72-NZ + 
CZ-CtermEYFP173 


NtermE[F64L]YFP1 57-NZ + 
CZ-CtermEYFP158 


vs 


NtermE[F64L]YFP1 72-NZ + 
CZ-CtermEYFP172 



Effects of F64L (+F64L better): 



Good 


Better 


Nterm EYFP1 57-NZ + 


vs NtermE[F64L]YFP1 57-NZ + 


CZ-CtermEYFP158 


CZ-CtermEYFP158 


NtermEYFPI 72-NZ + 


vs Nterm E[F64L] YFP 1 72-NZ + 


CZ-CtermEYFP173 


CZ-CtermEYFP173 



5 It is interesting to note, that the optimal constructs. (NtermE[F64L]YFP1 72-NZ and CZ- 
CtermE[F64L]YFP173) when re-assembled is nearly as intense as EYFP itself. The great 
increase in fluorescence intensity is important in many types of quantitative cell analyses 
(e.g. high through-put screening and microscopy) to increase the signal to noise rations, 
to facilitate detection of low amounts of probes in vivo or in vitro, etc. 



10 Mixing NtermEYFP with CtermEGFP or NtermEGFP with CtermEYFP fragments can also 
produce functional fluorescent complexes, potentially of different colours (Figs. 8 and 9). 
Fragments having overlapping sequences are also functional and may be very attractive 
in e.g. functional cloning systems where highly flexible linkers sequences are required due 
to the very diverse nature of the fusion partners. The overlapping fragments permit either 

15 of the fusion partners to have a long linker sequence (Figure 8, quantified in Figure 9). 

Example 10: Construction of PS1 769,1 767,1 771 ,1768 

Plasmid PS1769 encodes a fusion of NtermE[F64L]YFP1 72 and FKBP, connected by a 
linker sequence GSGSGSGDITSLYKKAGST (1 letter amino acid code, SEQ ID NO: 11) 
derived in part from the Gateway recombination sequence. 
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Plasmid PS1767 encodes a fusion of NtermE[F64L]YFP1 72 and the FKBP binding part of 
FRAP, FRB (amino acids 2025-21 14 of FRAP), connected by a linker sequence 
GSGSGSGDITSLYKKAGST (1 letter amino acid code, SEQ ID NO: 12) derived in part 
from the Gateway recombination sequence. 

5 Plasmid PS1771 encodes a fusion FRB and CtermEYFP173, connected by a linker 
sequence DPAFLYKWISGSGSGSG (1 letter amino acid code, SEQ ID NO: 13) derived 
in part from the Gateway recombination sequence. 

Plasmid PS1768 encodes a fusion of FKBP and CtermEYFP173, connected by a linker 
sequence DPAFLYKWISGSGSGSG (1 letter amino acid, code, SEQ ID NO: 14) derived 
10 in part from the Gateway recombination sequence. 

Construction of plasmid PS 1 769. 

Plasmid PS1769 encodes a fusion of NtermE[F64l_]YFP172 and FKBP, connected by a 
linker sequence, under the control of a CMV promoter and with kanamycin and neomycin 
resistance as selectable marker in E.coli and mammalian cells, respectively. 

15 Plasmid PS1769 was derived from plasmids PS1779 (entry clone) and PS1679 

(destination vector). Plasmid PS1679 was derived from plasmids PS1672 and pEGFP- 
Cl(Clontech). Plasmid PS1672 was derived from plasmid PS1641 described above. 



Construction of intermediate PS1672. 

PS1641 was subjected to PCR with primers 2219 and 2222 (Table 2), and the ca 0.5 kb 
20 Nhe1-BamH1 fragment was ligated into pEGFP-C1 (Clontech) digested with Nhe1 and 
BamH1 . This replaces NtermEGFP with NtermE[F64L]YFP1 72 followed by a linker 
sequence, which encodes in frame linker sequence Gly-Ser-Gly-Ser-Gly-Ser-Gly, and a 
unique EcoRV site just upstream of BamH1. This plasmid is called PS1672. 



Construction of destination vector PS1 679. 
25 Plasmid PS1 672 was converted into a Gateway compatible destination vector by cutting 
the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
called PS1679. 
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Construction of Gateway entry clone PS1 779. 

The coding sequence of FKBP (GenBank Acc no XMJD16660) was isolated from human 
cDNA using PCR and primers 2442 and 1272 (Table 2). The ca 0.4 kb product was 
transferred by a BP reaction into donor vector pDONR207 f following the manufacturers 
5 recommendations (Invitrogen), to produce entry clone PS1779. 

Finally, the expression vector PS1769 was produced by transferring FKBP from entry 
clone PS1 779 with an LR reaction into destination vector PS1 679 following the 
manufacturers recommendations (Invitrogen), 

Construction of plasmid PS1 767. 

10 Plasmid PS1767 encodes a fusion of NtermE[F64L]YFP1 72 and the FKBP binding part of 
FRAP, FRB (amino acids 2025-2114 of FRAP), connected by a linker sequence, under 
the control of a CMV promoter and with kanamycin and neomycin resistance as 
selectable marker in E.coli and mammalian cells, respectively. 

Plasmid PS1767 was derived from plasmids PS1781 (entry clone) and PS1679 
15 (destination vector). Plasmid PS1679 was constructed as described above. 

Construction of Gateway entry clone PS1781. 

The FKBP binding part of FRAP (amino acids 2025-21 14, Gen Bank Acc no XM__001528) 
was isolated from human cDNA using PCR and primers 2444 and 1268 (Table 2). The ca 
0.3 kb product was transferred by a BP reaction into donor vector pDONR207, following 
20 the manufacturers recommendations (Invitrogen), to produce entry clone PS1781 . 

Finally, the expression vector PS1767 was produced by transferring FRB from entry clone 
PS1781 with an LR reaction into destination vector PS1679 following the manufacturers 
recommendations (Invitrogen). 

Construction of plasmid PS1771. 

25 Plasmid PS1771 encodes a fusion of the FKBP binding part of FRAP called FRB (amino 
acids 2025-21 14 of FRAP) and the C-terminal of EYFP (FRB-CtermEYFP1 73), connected 
by a linker sequence, under the control of a CMV promoter and with zeocin resistance as 
selectable marker in E.coli and mammalian cells. 
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Plasmid PS1771 was derived from plasmids PS1782 (entry clone) and PS1688 
(destination vector). Plasmid PS1688 was derived from plasmids PS1674 and PS609 
described above. Plasmid PS1674 was derived from plasmid PS1638 described above. 

Construction of intermediate PS1674. 
5 PS1638 was subjected to PGR with primers 2225 and 2132 (Table 2), and the ca 0.25 kb 
Nhe1-BamH1 fragment was ligated into PS609 digested with Nhe1 and BamH1. This 
replaces EGFP with EYFP(1 73-238) preceded by a linker sequence, which encodes in 
frame linker sequence Gly-Ser-Gly-Ser-Gly-Ser-Gly, and a unique EcoRV site just 
downstream of Nhe1 . This plasmid is called PS 1674. 

10 Construction of destination vector PS1 688. 

Plasmid PS1674 was converted into a Gateway compatible destination vector by cutting 
the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
called PS1688. 

1 5 Construction of Gateway entry clone PS 1 782. 

The FKBP binding part of FRAP (GenBank Acc no XIVT001528, amino acids 2025-21 14) 
was isolated from human cDNA using PCR and primers 2444 and 2445 (Table 2). The ca 
0.3 kb product was transferred by a BP reaction into donor vector pDONR207, following 
the manufacturers recommendations (Invitrogen), to produce entry clone PS1782. 

20 Finally, the expression vector PS1768 was produced by transferring FRB from entry clone 
PS1782 with an LR reaction into destination vector PS1688 following the manufacturers 
recommendations (Invitrogen). 

Construction of plasmid PS 1 768. 

Plasmid PS1768 encodes a fusion of FKBP and EYFP(1 73-238) (FKBP-CtermEYFP1 73), 
25 under the control of a CMV promoter and with zeocin resistance as selectable marker in 
E.coli and mammalian cells. 



Plasmid PS1768 was derived from plasmids PS1780 (entry clone) and PS1688 
(destination vector). Plasmid PS1688 was constructed as described above. 
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Construction of Gateway entry clone PS1780. 

The coding sequence of FKBP (GenBank Acc no XMJ316660) was isolated from human 
cDNA using PCR and primers 2442 and 2443 (Table 2). The ca 0.4 kb product was 
transferred by a BP reaction into donor vector pDONR207, following the manufacturers 
5 recommendations (Invitrogen), to produce entry clone PS1780. 

Finally, the expression vector PS1768 was produced by transferring FKBP from entry 
clone PS1780 with an LR reaction into destination vector PS 1688 following the 
manufacturers recommendations (Invitrogen). 

Example 11: Construction of an inducible interaction system using the GFP 
1 0 complementation method that demonstrates utility of the method in 
screening for compounds that inhibit protein-protein interactions. 

The immunosuppressive compound rapamycin binds to FK506 binding protein (FKBP) 
and simultaneously to the large PI3Kinase homolog FRAP (also known as mTOR or 
RAFT), and thus serves as an heterodimeriser compound for these two proteins. To use 

15 rapamycin to induce heterodimers between proteins of interest, one of the proteins is 
fused to FKBP domains, and the other to a 90 amino acid portion of FRAP, termed FRB, 
that is sufficient for the binding the FKBP-rapamycin complex (Chen et a/, PNAS 92, 4947 
(1995)). In this example fusions of FRB and FKBP were made to complementary halves 
of split-EYFP (which included the F64L mutation in the EYFP(1-172) sequence 

20 (NtermE[F64L]YFP1 72)), so that the complementation reaction could be controlled by 
addition of rapamycin. 

This example demonstrates that a model GFP complementation system using 
components which can be made to interact conditionally does respond as expected in a 
dose-dependent manner to the interaction stimulus. The example also provides 
25 information about the rate of fluorescence development for the E[F64L]YFP 

complementation system. Further it demonstrates that the system can be used to detect 
compounds that will block the interaction of proteins fused to the complementary halves of 
the E[F64L]YFP complementation system. 



The following fusion constructs were made as described in Example 10: 
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NtermE[F64L]YFP1 72-FKBP = plasmid code PS1769 
FRB-CtermEYFP173 = PS1771 
NtermE[F64L]YFP172-FRB = PS1767 
FKBP-CtermEYFP173 = PS1768 



5 Probes were co-transfected in pairs into CHO-hIR cells (supra), PS1769 with PS1771 and 
PS1767 with PS1768, using the transfection agent FuGENE™ 6 (Boehringer Mannheim 
Corp, USA) according to the method recommended by the suppliers. Cells were cultured 
in growth medium (HAM's F12 nutrient mix with Glutamax-1, 10 % foetal bovine serum 
(FBS), 100 \xg penicillin-streptomycin mixture ml" 1 (GibcoBRL, supplied by Life 
10 Technologies, Denmark)). Transfected cells were cultured in this medium, with the 
addition of two selection agents appropriate to the plasmids being used, being 1 mg/ml 
zeocin plus 0.5 mg/mlG418 sulphate. Cells were cultured at 37°C in 100% humidity and 
conditions of normal atmospheric gases supplemented with 5% CG 2 . 

After 10 to 12 days culture in the continuous presence of the selection agents, the 
15 resulting cell lines were judged to be stably transfected. For fluorescence microscopy, 
aliquots of cells were transferred to Lab-Tek chambered cover glasses (Nalge Nunc 
International, Naperville USA) and allowed to adhere for at least 24 hours to reach about 
80% confluence. Images were routinely collected using a Nikon Diaphot 300 inverted 
fluorescence microscope (Nikon Corp., Tokyo, Japan) using x20 (dry) and/or x40 (oil 
20 immersion) objectives and coupled to a Orca ER charged coupled device (CCD) camera 
(Hammamatsu Photonics K.K., Hammamatsu City, Japan). The cells are illuminated with 
a 100 W HBO arc lamp via a 470±20 nm excitation filter, a 510 nm dichroic mirror and a 
515±15 nm emission filter for minimal image background. Image collection, subsequent 
measurement and analysis of fluorescence intensity were all controlled by IPLab 
25 Spectrum for Windows software (Scanalytics, Fairfax, VA USA). 



Cells were also grown for 16 hours from a seeding density of approximately 1.0 x 10 5 cells 
per 400 |jL in plastic 96-well plates (Polyfiltronics Packard 96-View Plate or Costar Black 
Plate, clear bottom; both types tissue culture treated) both for imaging purposes and for 
measurements of fluorescence intensity in fluorescence plate readers. Prior to 
30 experiments, the cells are cultured over night without selection agent(s) in HAM.F-12 
medium with glutamax, 100 \xg penicillin-streptomycin mixture ml" 1 and 10 % FBS. This 
medium has low auto fluorescence enabling fluorescence measurements on cells straight 
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from the incubator. For endpoint measurements, cells in plates were routinely fixed with 
4% formaldehyde in phosphate buffered saline (PBS) + 10 jaM Hoechst 22538 for 10 
minutes, followed by 3 wash steps using PBS. The use of the nuclear dye Hoechst 22538 
enables correction of the EYFP fluorescence signal from each well for cell density. Plates 
5 prepared in this way were measured on a Fluoroskan Ascent CF plate reader 

(Labsystems, Finland) equipped with appropriate filter sets (EYFP: excitation 485 nm, 
emission 527 nm; Hoechst 22538: excitation 355 nM, emission 460 nm). 

Both cell lines CHO-hIR [PS1769 + PS1771] and CHO-hIR [PS1767 + PS1768] 
responded to rapamycin with a substantial increase in EYFP fluorescence after several 

1 0 hours incubation, as expected (Figure 1 0). At the starting condition for these cells (t=0), 
fluorescence is barely visible in most cells, although it was noted that some cells (< 5%) in 
the population had low, but appreciable, fluorescence before treatment (Figure 10a). After 
4 hours (Figure 10b) many cells (approximately 40%) had developed significantly greater 
EYFP fluorescence throughout the cytoplasmic and nuclear compartments^ After 16 hours 

15 (Figure 10c) the response per cell had increased further and encompassed a larger 
proportion of the cell population (approximately 70%). Results were essentially identical 
for the second cell line CHO-hIR [PS1769 + PS1771]. 

The graph in Figure 1 1 shows the rate of development of cellular EYFP fluorescence 
following rapamycin treatment of the CHO-hIR [PS1767 + PS1768] line. Cells were 

20 treated in 96-well plates with 3 p.M rapamycin and the fluorescence measured at various 
times. Treatment and measurements were made with the cells growing in HAM's medium 
+ 10% FBS, and fluorescence measurements were corrected for the background 
fluorescence from this medium. The graph demonstrates that the half-time for 
development of fluorescence is approximately 5 hours. The rate of development of 

25 fluorescence includes time for interaction between FKBP and FRB mediated by the 
dimeriser rapamycin, plus the time for annealing of the EYFP moieties, and the 
(presumably much longer) time needed for maturation of the fluorophore within the 
successfully annealed EYFP protein. 

Figure 12 is a response curve to different rapamycin doses for the CHO-hIR [PS1769 + 
30 PS1771] cell line. Cells were cultured in 96-well plates, treated with various rapamycin 
doses for 16 hours, then fixed and stained with Hoechst prior to determination of EYFP 
fluorescence/cell (arbitrary units) on the Ascent plate reader. Values are corrected for 
PBS background as well as cell number. The cell line shows approximately a 3-fold 
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increase in the EYFP intensity/cell over the dose range of rapamycin used in this 
experiment. 

One way to increase the dynamic range of the response, and to decrease the inherent 
EYFP background signal from these cell lines, is to remove the fraction of cells that are 
5 EYFP bright prior to rapamycin stimulation. This is easily accomplished through 

fluorescence activated cell sorting (FACS) methods. Each of the cell lines were sorted by 
this method into 3 groups: (i) most green group (ii) medium to low-green group and (iii) 
black group. The 'most green' was discarded in each case, while the other 2 groups were 
cultured for further use. Figure 13 (a) and Figure 13 (b) show the improved response to 
10 1 00 nM rapamycin of cell line CHO-hIR [PS1 767 + PS1 768] after the sorting procedure. 

Figure 14 (a) and (b) show the response of the 'medium to low-green 1 and 'black' FACS 
groups (respectively) derived from the CHO-hIR [PS1767 + PS1768] parent line. Dose 
response to rapamycin was measured after 7 hours (a) and 30 hours (b) for each cell line. 
Values for fluorescence have been corrected for plate & medium background. Increase in 

15 EYFP fluorescence is better than 20-fold the unstimulated value in each case. 

Unexpectedly, the absolute fluorescence signal does not appear to change significantly 
between 7 and 30 hours, although the cells are still alive during this period. Furthermore, 
the dose-response curves at 7 and 30 hours for each cell line are very closely similar, with 
an EC 50 of approximately 0.25 j^M in the 'medium to low-green' group, and 0.1 jaM in the 

20 'black 1 group. This data suggest that once the dimerisation has occurred, the EYFP 
complements are stable within the cells for longer than 30 hours. The 'medium to low- 
green' group has a greater overall response range, reaching intensities of greater than 3- 
fold that of the black group at the highest rapamycin concentration. Both FACS groups 
have significantly lower pre-stimulation fluorescence intensities compared to the parent 

25 (non-FACS'd) lines. 

Figure 15 (a) and (b) show dose-response competition curves for FK506 versus 100 nM 
rapamycin in two of the FACS'd lines, CHO-hIR [PS1768 + PS1767] 'mid to low-green' 
group (Figure 15 (a)) and CHO-hIR [PS1769 + PS1771] 'black' group (Figure 15 (b)). EC 50 
values in both cases are approximately 1.2 jiM FK506. The cells were incubated 
30 overnight (16 hours) with mixtures of the two compounds, then fixed and stained with 
Hoechst prior to detemination of EYFP fluorescence/cell on an Ascent plate reader. Plate 
and solution backgrounds have been subtracted; the dashed lines on each graph indicate 
the prestimulated fluorescence levels for each cell line in these experiments. These 
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results indicate that the GFP complementation method employing fusions to 
NtermE[F64L]YFP172 and CtermEYFP173 may be used successfully to screen for 
compounds that interfere with a conditional interaction between two protein components. 

Example 12: Construction ofPS1736, PS1758, PS1706, PS1810, PS1809, and 
5 PS1827 

Plasmid PS1736 encodes a fusion of F64L.NY172-NZ and H2B, connected by a linker 
sequence GSGSGSGDITSLYKKAGST (1 letter amino acid code, SEQ ID NO: 74) derived 
in part from the Gateway recombination sequence. 

Plasmid PS1758 encodes a fusion of beta-Catenin and CZ-CY173, connected by a linker 
10 sequence DPAFLYKWISGSGSGSG (1 letter amino acid code, SEQ iD NO: 69) derived 
in part from the Gateway recombination sequence. 

Plasmid PS1706 encodes a fusion of F64L,NY172 (=NtermE[F64L]YFP1 72) and beta- 
Catenin, connected by a linker sequence GSGSGSGDITSLYKKAGST (1 letter amino acid 
code, SEQ ID NO: 70) derived in part from the Gateway recombination sequence. 

15 Plasmid PS1810 encodes a fusion of H2B and TCF4(1-70) and CY173 (CtermEYFP173), 
where H2B and TCF4(1-70) are connected by a linker sequence DITSLYKKAGST (SEQ 
ID NO: 75) derived in part from the Gateway recombination sequence, and where 
TCF4(1-70) and CY173 (CtermEYFP173 ) are connected by a linker sequence. 
DPAFLYKWISGSGSG (1 letter amino acid code, SEQ ID NO: 71) derived in part from 

20 the Gateway recombination sequence. 

Plasmid PS1809 encodes a fusion of F64L,NY172 (NtermE[F64L]YFP1 72) and FKBP and 
beta-Catenin, where F64L,NY172 (NtermE[F64L]YFP172) and FKBP are connected by a 
linker sequence GSGSGSGDL (1 letter amino acid code, SEQ ID NO: 72) and where 
FKBP and beta-Catenin are connected by a linker sequence GTGTGTGDITSLYKKAGST 
25 (SEQ ID NO: 76) derived in part from the Gateway recombination sequence. 

Plasmid PS 1827 encodes a fusion of H2B and the FKBP binding part of FRAP (amino 
acids 2025-2114) and CY173 (CtermEYFP173), where H2B and the FKBP binding part of 
FRAP (amino acids 2025-21 14)are connected by a linker sequence 
DPAFLYKWISGTGTGTG (1 letter amino acid code, SEQ ID NO: 73) derived in part from 
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the Gateway recombination sequence, and where the FKBP binding part of FRAP (amino 
acids 2025-21 14) and CY173 (CtermEYFP173) are connected by a linker sequence 
LPSGSGSGSG (SEQ ID NO: 77). 

Construction of plasmid PS1736. 

5 Plasmid PS1736 encodes a fusion of F64L,NY172-NZ and H2B, connected by a linker 
sequence, under the control of a CMV promoter and with kanamycin and neomycin 
resistance as selectable marker in E.coli and mammalian cells, respectively. 

Plasmid PS1736 was derived from plasmids PS1667 (entry clone) and PS1695 
(destination vector). Plasmid PS1695 was derived from plasmid PS1673. Plasmid PS1673 
10 was derived from plasmids PS1641 described above and pEGFP-CI(Clontech). Plasmid 
PS1672 was derived from plasmid PS1641 described above. 

Construction of intermediate PS1673. 

PS1641 was subjected to PCR with primers 2219 and 2387 (table 2), and the ca 0.5 kb 
Nhe1-BamH1 fragment was ligated into pEGFP-C1 (Clontech) digested with Nhe1 and 
15 BamH1 . This replaces EGFP with F64L,EYFP(1-172)followed by a linker sequence, which 
encodes in frame linker sequence GGTGSG-NZ-GSGSGSG, and a unique EcoRV site 
just upstream of BamH1. This plasmid is called PS1673. 

Construction of destination vector PS1 695. 

Plasmid PS1673 was converted into a Gateway compatible destination vector by cutting 
20 the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
called PS1695. 

Construction of Gateway entry clone PS1667. . 

The coding sequence of H2B (GenBank Acc no NM_003518) was isolated from human 
25 cDNA using PCR and primers 2367 and 2368 (Table 2). The ca 0.4 kb product was 
transferred by a BP reaction into donor vector pDONR207, following the manufacturers 
recommendations (Invitrogen), to produce entry clone PS 1667 

Finally, the expression vector PS1736 was produced by transferring H2B from entry clone 
PS1667 with an LR reaction into destination vector PS1695 following the manufacturers 



WO 03/089627 



CT/DK03/00266 



60 



Construction of plasm id PS1 758. 

Plasmid PS1758 encodes a fusion of beta-Catenin and CZ-CY173, connected by a linker 
sequence, under the control of a CMV promoter and with zeocin resistance as selectable 
marker in E.coli and mammalian cells. 

5 Plasmid PS1771 was derived from plasmids PS1762 (entry clone) and PS1696 

(destination vector). Plasmid PS1696 was derived from plasmid PS1726. Plasmid PS1726 
was derived from plasmids PS1638 and PS609 described above. 

Construction of intermediate PS1 726. 

PS1638 was subjected to PCR with primers 2388 and 2132 (table 2), and the ca 0.35 kb 
10 Nhe1-BamH1 fragment was ligated into PS609 digested with Nhe1 apd BamH1. This 
replaces EGFP with EYFP(1 73-238)preceded by a linker sequence, which encodes in 
frame linker sequence GSGSGSG-zip, and a unique EcoRV site just downstream of 
Nhe1. This plasmid is called PS1726. 

Construction of destination vector PS1 696. 
15 Plasmid PS1726 was converted into a Gateway compatible destination vector by cutting 
the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
called PS 1696. 

Construction of Gateway entry clone PS 1 762. 
20 The coding sequence of beta-Catenin(GenBank Acc no X87838, was isolated from 
human cDNA using PCR and primers 2359 and 2439 (Table 2). The ca 2.3 kb product 
was transferred by a BP reaction into donor vector pDONR207, following the 
manufacturers recommendations (Invitrogen), to produce entry clone PS1762. 



25 



Finally, the expression vector PS1758 was produced by transferring the coding sequence 
of beta-Catenin from entry clone PS1762 with an LR reaction into destination vector 
PS1696 following the manufacturers recommendations (Invitrogen). 
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Construction of plasm id PS1706. 

Plasmid PS1706 encodes a fusion of F64L,NY172 and beta-Catenin, connected by a 
linker sequence, under the control of a CMV promoter and with kanamycin and neomycin 
resistance as selectable marker in E.coli and mammalian cells, respectively. 

5 Plasmid PS1706 was derived from plasmids PS1664 (entry clone) and PS1679 
(destination vector). Plasmid PS1679 is described above. 

Construction of Gateway entry clone PS 1664. 

The coding sequence of beta-Catenin (GenBank Acc no X87838) was isolated from 
human cDNA using PCR and primers 2359 and 2360 (table 2). The ca 2.3 kb product was 
10 transferred by a BP reaction into donor vector pDONR207, following the manufacturers 



Finally, the expression vector PS1706 was produced by transferring beta-Catenin from 
entry clone PS1664 with an LR reaction into destination vector PS1679 following the 
manufacturers recommendations (Invitrogen). 

1 5 Construction of plasmid PS1810. 

Plasmid PS1810 encodes a fusion of H2B-TCF4(1-70) and CY173, connected by a linker 
sequence, under the control of a CMV promoter and with zeocin resistance as selectable 
marker in E.coli and mammalian cells. 

Plasmid PS1810 was derived from plasmids PS1669 (entry clone) and PS1806 
20 (destination vector). Plasmid PS1806 was derived from plasmid PS1790. Plasmid PS1790 
was derived from plasmid PS1674 described above. 

Construction of intermediate PS1790. 

The coding sequence of H2B (GenBank Acc no NMJD03518) was isolated from human 
cDNA with primers 2389 and 2390 (Table 2), and the ca 0.4 kb Nhe1 -EcoRV fragment 
25 was ligated into PS1674 digested with Nhe1 and EcoRV. This places H2B upstream of 
EYFP(1 73-238), which is preceded by a linker sequence, which encodes in frame linker 
sequence GSGSGSG-zip, and a unique EcoRV site between H2B and the linker- 
EYFP(1 73-238). This plasmid is called PS1790. 



recommendations (Invitrogen), to produce entry clone PS1664. 
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Construction of destination vector PS1 806. 

Plasmid PS1790 was converted into a Gateway compatible destination vector by cutting 
the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
5 called PS1 806. 

Construction of Gateway entry clone PS1669. 

The coding sequence of the N-terminal 70 amino acids of TCF4 (GenBank Acc no 
Y1 1306), was isolated from human cDNA using PCR and primers 2364 and 2366 
described below. The ca 0.25 kb product was transferred by a BP reaction into donor 
10 vector pDONR207, following the manufacturers recommendations (Invitrogen), to produce 
entry clone PS1 669. 

Finally, the expression vector PS1810 was produced by transferring the coding sequence 
of TCF4(1-70) from entry clone PS1669 with an LR reaction into destination vector 
PS1806 following the manufacturers recommendations (Invitrogen). 

15 Construction of plasmid PS1809. 

Plasmid PS1809 encodes a fusion of F64L,NY172 and FKBP-betaCatenin, connected by 
a linker sequence, under the control of a CMV promoter and with kanamycin and 
neomycin resistance as selectable marker in E.coli and mammalian cells, respectively. 

Plasmid PS1809 was derived from plasmids PS1664 (entry clone)described above and 
20 PS1805 (destination vector). Plasmid PS1805 was derived from plasmid PS1789. Plasmid 
PS1789 was derived from plasmids PS1672 described above and PS1768 described 
above. 

Construction of intermediate PS1 789. 

PS1768 was subjected to PCR with primers 2462 and 2463, and the ca 0.35 kb fragment 
25 was digested with Pvu2 and BamH1 and ligated into PS1672 digested with EcoRV and 
BamH1. This produces a fusion of F64L,NY172 and FKBP connected by a GSGSGSGDL 
linker and with a GTGTGTG linker sequence behind FKBP and a. unique EcoRV site just 
downstream of the GT3 linker. This plasmid is called PS1789. 
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Construction of destination vector PS1805. 

Plasmid PS1789 was converted into a Gateway compatible destination vector by cutting 
the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
5 called PS1 805. 

Finally, the expression vector PS1809 was produced by transferring beta-Catenin from 
entry clone PS1664 with an LR reaction into destination vector PS1805 following the 
manufacturers recommendations (Invitrogen). 

Construction of plasmid PS1 827. 

10 Plasmid PS1827 encodes a fusion of H2B and the FKBP binding part of FRAP (amino 
acids 2025-21 14)and CY173, connected by a linker sequence, under the control of a CMV 
promoter and with zeocin resistance as selectable marker in E.coli and mammalian cells. 

Plasmid PS1827 was derived from plasmids PS1732 (entry clone) and PS1826 
(destination vector). Plasmid PS1826 was derived from plasmid PS1788. Plasmid PS1788 
15 was derived from plasmids PS1674 and PS1767 described above. 

Construction of intermediate PS1 788. 

PS1767 was subjected to PCR with primers 2464 and 2465, and the ca 0.35 kb product 
was digested with Nhe1-Sma1 and ligated into PS1674 digested with Nhe1 and EcoRV. 
This produces a fusion of FRAP(2025-2114) and EYFP(1 73-238) connected by a linker 
20 sequence, GSGSGSG, and with a GTGTGTG linker sequence in front of. FRAP, and a 
unique EcoRV site just upstream of the GTGTGTG linker. This plasmid is called PS1788. 

Construction of destination vector PS 1826. 

Plasmid PS1788 was converted into a Gateway compatible destination vector by cutting 
the DNA with EcoRV and ligating it with Gateway Cassette reading frame A, following the 
25 recommendations of the Gateway manufacturer (Invitrogen). This destination vector is 
called PS1826. 

Construction of Gateway entry clone PS1732. 

The coding sequence of H2B (GenBank Acc no NM__003518) was isolated from human 
cDNA using PCR and primers 2367 and 2407 (Table 2). The ca 0.4 kb product was 
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transferred by a BP reaction into donor vector pDONR207, following the manufacturers 
recommendations (Invitrogen), to produce entry clone PS1 732 

Finally, the expression vector PS1827 was produced by transferring H2B from entry clone 
PS1732 with an LR reaction into destination vector PS1826 following the manufacturers 
5 recommendations (Invitrogen). 



Example 13: Assay for screening in live cells for compounds that modify the 
translocation behaviours of one or both members of a pair of target 
proteins and/or the interaction between those proteins. 

The present example shows the construction of a protein translocation assay using the 
10 GFP complementation method that demonstrates utility of the method in screening in live 
cells for compounds that modify the translocation behaviours of one or both members of 
a pair of target proteins and/or the interaction between those proteins (Translocation and 
Interaction Dependent Complementation, TIDC). 



This example demonstrates that GFP complementation systems can be designed to 
15 report on the degree of interaction between one protein and a partner protein within living 
cells, specifically where such an interaction can only take place after translocation of one 
or both proteins has occurred that serves to bring both components together in one 
compartment or location within the cell. A fluorescence signal is obtained that 
corresponds with the degree of interaction that occurs between the partner proteins, 
20 usually following a stimulus that induces the appropriate translocation. Such an assay 
system can be used to screen for compounds or treatments that will modify either the 
translocation process(es) involved or the interaction between the target proteins used in 
construction of the complementation probes. 

p-catenin is a multifunctional protein that plays a key role in the development and 
25 progression of certain human cancers, most notably cancer of the colon. In a non-dividing 
colonic epithelial cell p-catenin is found in complexes with cell adhesion molecules at the 
plasma membrane, and also in complex with a number of proteins (Axin, Adenomatous 
polyposis coli protein, or APC, and the protein kinase GSK3P) that together control its 
targeted destruction by the cellular ubiquitination system, p-catenin also functions as an 
30 activator of transcription, for which function it needs to translocate to the nucleus and 
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dimerise with other pro-trariscriptional co-factors, the most significant of which in colonic 
cells is TCF4. The level of p-catenin in non-dividing cells is kept low by ubiquitination and 
subsequent destruction by proteosomes, and little or no p-catenin is found in the nucleus, 
and thus little or no p-catenin-induced gene transcription occurs. In cancerous and certain 
5 pre-cancerous colon epithelial cells, mutations in either p-catenin itself or in one of its 
regulatory proteins lead to accumulation of the protein in the cytoplasm, with subsequent 
translocation to the nucleus and constitutive activation of transcription from genes 
controlled by p-catenin:TCF4 factors. It is continuous expression of these genes that 
leads to the cancerous phenotype in such cells. 

10 In this example, complementary halves of GFP are fused separately , to p-catenin and also 
to histone H2B coupled to the p-catenin-binding fragment of the transcriptional co- 
activator TCF4. Both constructs are then co-transfected and expressed within mammalian 
cells. The p-catenin construct is expected to be localised entirely in the cell cytoplasm at a 
low level, its concentration effectively controlled by the activity of the protein kinase 

15 GSK3p. The H2B-TCF4 construct will be localised in the nuclei of cells by the highly basic 
histone moeity. Hence, the two components should not meet one another under normal 
growth conditions. However, when GSK3p is inhibited by lithium ions, the cytoplasmic 
level of the p-catenin construct is expected to increase in the cytoplasm, and to 
translocate in significant fashion to the nucleus. When the constructs find themselves 

20 mixed together in the nucleus as a result of p-catenin translocation, the natural interaction 
between the two proteins (p-catenin and TCF4) will bring the complementary GFP halves 
into close apposition, whereupon the annealing process should be initiated that will 
ultimately lead to development of fully fluorescent GFP molecules. The development of 
nuclear fluorescence in lithium-treated cells therefore effectively reports on the combined 

25 result of translocation of p-catenin to the nucleus and the subsequent interaction between 
P-catenin and the p-catenin binding domain of TCF4. 

The following fusion constructs were made (see Example 12): 
NtermE[F64L]YFP172.-yff-catenin = plasmid code ps1706 
H2B-TCF4(frag)- CtermEYFP173= ps1810 

30 Plasmids ps1706 with ps1810 were co-transfected into Chinese hamster ovary cells 
(CHO) using the transfection agent FuGENE™ 6 (Boehringer Mannheim Corp, USA) 
according to the method recommended by the suppliers. Cells were cultured in growth 
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medium (HAM's F12 nutrient mix with Glutamax-1, 10 % foetal bovine serum (FBS), 100 
jag penicillin-streptomycin mixture ml' 1 (GibcoBRL, supplied by Life Technologies, 
Denmark)). Transfected cells were cultured in this medium, with the addition of two 
selection agents appropriate to the plasmids being used, being 1 mg/ml zeocin plus 0.5 
5 mg/mlG41 8 sulphate. Cells were cultured at 37°C in 100% humidity and conditions of 
normal atmospheric gases supplemented with 5% C0 2 . 

After 10 to 12 days culture in the continuous presence of the selection agents, the 
resulting cell lines were judged to be stably transfected. For fluorescence microscopy, 
aliquots of cells were transferred to Lab-Tek chambered cover glasses (Nalge Nunc 
10 International, Naperville USA) and allowed to adhere for at least 24 hours to reach about 
80% confluence. Images were collected using a Zeiss LSM410 inverted confocal 
fluorescence microscope (Zeiss, Jena, Germany) using x40 (oil immersion) objective. 488 
nm laser light was used for excitation of GFP, emissions filtered via a FT510 dichroic and 
510-525 nm bandpass filter. 

1 5 After treatments, but prior to microscopy, cells were routinely fixed with 4% formaldehyde 
in phosphate buffered saline (PBS) for 10 minutes, followed by 3 wash steps using PBS. 

Figure 18 (A) shows the level of fluorescence in stable CHO[ps1706 + ps1810] prior to 
any treatment of the cells. The level of fluorescence is very weak in most cells and may 
be due entirely to cellular autofluorescence from flavanoids such as FAD (the emisbn 

20 filter used typically allows a fraction of FAD autofluorescence to pass). Occasional cells 
(top right corner of Figure 18 (A)) show a weak GFP fluorescence in nuclear and 
cytoplasmic compartments, apparently the result of spontaneous complementation. Most 
of these cells are found clustered, possibly indicating a clonal behaviour that may reflect 
incorrect processing of the introduced components, such that they are brought together 

25 into the same compartment during expression (or possibly cell division) where 

complementation can occur. However, when treated for 16 hours with 5 mM LiCI, the 
number of cells containing fluorescent GFP increases greatly, and many of these have the 
GFP fluorescence confined exclusively to their nuclei consistent with the distribution of 
GFP complementation expected from this pair of fusion constructs (Figure 18 (B)). 



30 



These results indicate that the GFP complementation method employing fusions to 
NtermE[F64L]YFP172and CtermEYFP173 of protein components that normally reside in 
separated cellular compartments or locations, can be used to monitor the protein-protein 
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interaction between those components that result from translocation processes bringing 
both components together within one compartment or at one shared location. A simple 
intensity or image-based fluorescence assay based on this principle may be used to 
screen for compounds that affect either the translocation processes relevant to specific 
5 cellular components or the interactions between such protein components in living 
mammalian cells. Used in combination with a similar assay designed to report solely on 
the translocation processes involved (e.g. an assay similar to that described in Example 
14 or Example 16), it will be possible to define whether any compound is specifically 
targeting translocation or interaction. 



10 Example 14: Protein translocation assay using GFP complementation 

This method that demonstrates utility of the method in screening in live cells for 
compounds that modify the translocation behaviour of a target protein (Translocation 
Dependent Complementation, TDCzip). 

This example demonstrates that GFP complementation systems can be designed to 
15 report specifically on the translocation behaviour of a target protein within living cells. A 
fluorescence signal is obtained that corresponds directly with the degree of translocation 
that occurs. Such an assay system can be used to screen for compounds or treatments 
that will modify the translocation behaviour of the target protein. 

p-catenin is a multifunctional protein that plays a key role in the development and 
20 progression of certain human cancers, most notably cancer of the colon. In a non-dividing 
colonic epithelial cell p-catenin is found in complexes with cell adhesion molecules at the 
plasma membrane, and also in complex with a number of proteins (Axin, Adenomatous 
polyposis coli protein, or APC, and the protein kinase GSK3P) that together control its 
targeted destruction by the cellular ubiquitination system, p-catenin also functions as an 
25 activator of transcription, for which function it needs to translocate to the nucleus and 
dimerise with other pro-transcriptional co-factors, the most significant of which in colonic 
cells is TCF4. The level of p-catenin in non-dividing cells is kept low by ubiquitination and 
subsequent destruction by proteosomes, and little or no p-catenin is found in the nucleus, 
and thus little or no p-catenin-induced gene transcription occurs. In cancerous and certain 
30 pre-cancerous colon epithelial cells, mutations in either p-catenin itself or in one of its 
regulatory proteins lead to accumulation of the protein in the cytoplasm, with subsequent 
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translocation to the nucleus and constitutive activation of transcription from genes 
controlled by p-catenin :TCF4 factors. It is continuous expression of these genes that 
leads to the cancerous phenotype in such cells. 

In this example, complementary halves of GFP are fused separately to p-catenin and to 
5 histone H2B and co-expressed within cells. The p-catenin construct is expected to be 
localised entirely in the cell cytoplasm at a low level, its concentration effectively 
controlled by the activity of the protein kinase GSK3p. The H2B construct will be localised 
in the nuclei of cells by the highly basic histone moeity. Hence, the two components 
should not meet one another under normal growth conditions. However, when GSK3p is 

10 inhibited by lithium ions, the cytoplasmic level of the p-catenin construct is expected to 
increase in the cytoplasm, and to translocate in significant fashion to the nucleus. Both 
constructs have been engineered to contain a leucine zipper domain. When the 
constructs find themselves mixed together in the nucleus as a result of p-catenin 
translocation, the leucine zippers will dimerise bringing the complementary GFP halves 

15 into close apposition, whereupon the annealing process is initiated that ultimately leads to 
development of fully fluorescent GFP molecules. The development of nuclear . 
fluorescence in lithium-treated cells therefore effectively reports on the translocation of p- 
catenin to the nucleus. 

The following fusion constructs were made (see Example 12): 
20 NtermE[F64L]YFP172-Zip-H2B = plasmid code ps 1736 
p-catenin~Zip-NtermE[F64L]YFP172 = ps1758 

Plasmids ps1736 with ps1758 were co-transfected into Chinese hamster ovary cells 
(CHO) using the transfection agent FuGENE™ 6 (Boehringer Mannheim Corp, USA) 
according to the method recommended by the suppliers. Cells were cultured in growth 

25 medium (HAM's F12 nutrient mix with Glutamax-1, 10 % foetal bovine serum (FBS), 100 
fig penicillin-streptomycin mixture ml" 1 (GibcoBRL, supplied by Life Technologies, 
Denmark)). Transfected cells were cultured in this medium, with the addition of two 
selection agents appropriate to the plasmids being used, being 1 mg/ml zeocin plus 0.5 
mg/mlG418 sulphate. Cells were cultured at 37°C in 100% humidity and conditions of 

30 normal atmospheric gases supplemented with 5% CQ 2 . 
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After 10 to 12 days culture in the continuous presence of the selection agents, the 
resulting cell lines were judged to be stably transfected. For fluorescence microscopy, 
aliquots of cells were transferred to Lab-Tek chambered cover glasses (Nalge Nunc 
International, Naperville USA) and allowed to adhere for at least 24 hours to reach about 
5 80% confluence. Images were routinely collected using a Nikon Diaphot 300 inverted 
fluorescence microscope (Nikon Corp., Tokyo, Japan) using x20 (dry) and/or x40 (oil 
immersion) objectives and coupled to a Orca ER charged coupled device (CCD) camera 
(Hammamatsu Photonics K.K., Hammamatsu City, Japan). The cells are illuminated with 
a 1 00 W HBO arc lamp via a 470±20 nm excitation filter, a 510 nm dichroic mirror and a 
10 515±15 nm emission filter for minimal image background. Image collection, subsequent 
measurement and analysis of fluorescence intensity were all controlled by IPLab 
Spectrum for Windows software (Scanalytics, Fairfax, VA USA). 

After treatments, cells were routinely fixed with 4% formaldehyde in phosphate buffered 
saline (PBS) for 10 minutes, followed by 3 wash steps using PBS. 

15 Figure 19 (A) shows the level of fluorescence in stable CHO[ps1736 + ps1758] prior to 
any treatment of the cells. The level of fluorescence is very weak and may be due entirely 
to cellular autofluorescence from flavanoids such as FAD (the emision filter used typically 
allows a fraction of FAD autofluorescence to pass). The distribution of this fluorescence is 
also consistent with FAD autofluorescence, it being principally in the cytoplasm, with a 

20 somewhat punctate appearance. When treated for 16 hours with 5 mM LiCI, a fraction of 
these cells display GFP-bright nuclei, consistent with the distribution of GFP 
complementation expected from this pair of fusion constructs (Figure 19 (B)). 

These results indicate that the GFP complementation method employing fusions to 
NtermE[F64L]YFP172and CtermEYFP173 of protein components that normally reside in 
25 separated cellular compartments or locations, can be used to monitor those translocation 
processes that result in both components coming together within one compartment or at 
one shared location. A simple intensity or image-based fluorescence assay based on this 
principle may be used to screen for compounds that affect the translocation processes 
relevant to specific cellular components in living mammalian cells. 
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Example 15: Exploitation of the inducible FRBP12.FRB interaction 

As illustrated in Example 14, some assays will experience a sink effect: the problem is 
that the natural shuttling of any naturally translocating protein is exposing the anchored 
zipper construct to a constant supply of binding partner resulting in irreversible binding 
5 and premature fluorophore formation. The same problem can be encountered in TIDC 
systems that depend on a constitutive natural interaction like the interaction between 
beta-catenin and TCF-4 (example 14). 

The FKBP12:FRB system, is a convenient way of associating e.g. the nuclear fraction of a 
translocating protein fused to a complementation partner A to an anchor fused to 
10 complementation partner B at any given time. 

The fusions that appear most appropriate based on the co-crystal structure of 
FKBP12:rapamycin:FRB (Figure 17) is to fuse the both split EYFP F64L fragments to the 
C-termini of FKBP12 and FRB because they are close to each other. However, as we 
generally favor fusing the N-terminal split fragment to the N-terminus of the fusion partner 
15 and visa versa, this orientation was chosen in Example 16. This is in part due to our 
experience that distance of at least 40A between the termini does not limit the 
complementation (GFP is about 40A long). Using FKBP12 and FRB we will need 10-15 
residue linkers on both sides of FKBP12 and FRB. The linker length will be between 1.5A 
(oc-helix) and 3.5A (p-sheet) per residue (see Figure 17). 



20 Example 16: Protein translocation assay with inducible interaction partners 

This example describes the construction of a protein translocation assay using an 
inducible GFP complementation method that demonstrates utility of the method in 
screening in live cells for compounds that modify the translocation behaviour of a target 
protein (Translocation Dependent Complementation with Rapamycin, TDCrap). 



25 This example demonstrates that an inducible GFP complementation systems can be 
designed to report specifically on the translocation behaviour of a target protein within 
living cells with an improved signal to background ratio. Following induction of the 
translocation process, complementation is stimulated by bringing together the interacting 
proteins and a fluorescence signal is obtained that corresponds directly with the degree of 

30 translocation that occurs. Such an assay system can be used to screen for compounds or 
treatments that will modify the translocation behavior of the target protein. 
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An additional linker sequence has been added in each because we no longer have the 
tightly associated zipper peptides involved. The Gateway recombination sequences are 
about 10 residues which. should be enough between the inserted fusion proteins and 
FKBP12/FRB. The connections to the split fragments should be 12-20 residues. 



5 p-catenin is a multifunctional protein that plays a key role in the development and 

progression of certain human cancers, most notably cancer of the colon. In a non-dividing 
colonic epithelial cell p-catenin is found in complexes with cell adhesion molecules at the 
plasma membrane, and also in complex with a number of proteins (Axin, Adenomatous 
polyposis coli protein, or APC, and the protein kinase GSK3p) that together control its . 

10 targeted destruction by the cellular ubiquitination system, p-catenin also functions as an 
activator of transcription, for which function it needs to translocate to the nucleus and 
dimerise with other pro-transcriptional co-factors, the most significant of which in colonic 
cells is TCF4. The level of p-catenin in non-dividing cells is kept low by ubiquitination and 
subsequent destruction by proteosomes, and little or no p-catenin is found in the nucleus, 

15 and thus little or no p-catenin-induced gene transcription occurs. In cancerous and certain 
pre-cancerous colon epithelial ceils, mutations in either p-catenin itself or in one of its 
regulatory proteins lead to accumulation of the protein in the cytoplasm, with subsequent 
translocation to the nucleus and constitutive activation of transcription from genes 
controlled by p-catenin:TCF4 factors. It is continuous expression of these genes that 

20 leads to the cancerous phenotype in such cells. 

In this example, complementary halves of GFP are fused separately to p-catenin and to 
histone H2B and co-expressed within cells. The p-catenin construct is expected to be 
localised entirely in the cell cytoplasm at a low level, its concentration effectively 
controlled by the activity of the protein kinase GSK3p. The H2B construct will be localised 

25 in the nuclei of cells by the highly basic histone rnoeity. Hence, the two components 
should not meet one another under normal growth conditions. However, when GSK3P is 
inhibited by lithium ions, the cytoplasmic level of the p-catenin construct is expected to 
increase in the cytoplasm, and to translocate in significant fashion to the nucleus. Both 
constructs have also been engineered to contain inducible interaction domains. The p- 

30 catenin construct includes FK506 binding protein (FKBP). The H2B construct includes 
FRB, which is the 93 amino acid portion of FRAP, the large PI3Kinase homolog FRAP 
(also known as mTOR or RAFT). The immunosuppressive compound rapamycin binds to 
FKBP and simultaneously to FRB, and thus serves as an heterodimeriser compound for 
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these two proteins. The complementation reaction between the separate halves of GFP 
that are included in these two constructs can therefore be induced by addition of 
rapamycin, but only if the protein constructs are able to meet each other at the same 
cellular location. Because FKBP and FRB do not interact in the absense of rapamycin, the 
5 "sink effect" is greatly reduced or eliminated. 

When the constructs find themselves mixed together in the nucleus as a result of p- 
catenin translocation, addition of rapamycin brings the complementary GFP halves into 
close apposition via the inducible FKBP:FRB interaction, whereupon the annealing 
process should be initiated that will ultimately lead to development of fully fluorescent 
10 GFP molecules. In this example, the development of nuclear GFP fluorescence that 

results after complementation is induced by rapamycin therefore reports on the degree of 
translocation into the nucleus of the yff-catenin construct at the time of rapamycin addition. 

The following fusion constructs were made (see Example 12): 
NtermE[F64L]YFP172-FKBP-beta-Catenin = plasmid code ps1809 
15 H2B-FRAP(2024-2114)-CtermEYFP173 = ps1827. 

Plasmids ps1809 with ps1827 were co-transfected into Chinese hamster ovary cells 
(CHO) using the transfection agent FuGENE™ 6 (Boehringer Mannheim Corp, USA) 
according to the method recommended by the suppliers. Cells were cultured in growth 
medium (HAM's F12 nutrient mix with Glutamax-1 , 10 % foetal bovine serum (FBS), 100 
lag penicillin-streptomycin mixture ml" 1 (GibcoBRL, supplied by Life Technologies, 
Denmark)). Transfected cells were cultured in this medium, with the addition of two 
selection agents appropriate to the plasmids being used, being 1 mg/ml zeocin plus 0.5 
mg/mlG41 8 sulphate. Cells were cultured at 37°C in 100% humidity and conditions of 
normal atmospheric gases supplemented with 5% C0 2 . 

After 10 to 12 days culture in the continuous presence of the selection agents, the 
resulting cell lines were judged to be stably transfected. For fluorescence microscopy, 
aliquots of cells were transferred to Lab-Tek chambered cover glasses (Nalge Nunc 
International, Naperville USA) and allowed to adhere for at least 24 hours to reach about 
80% confluence. All treatments were carried out in full HAM's medium + 10% FBS. 
Images were collected using a Zeiss LSM410 inverted confocal fluorescence microscope 
(Zeiss, Jena, Germany) using x40 (oil immersion) objective. 488 nm laser light was used 
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for excitation of GFP, emissions filtered via a FT510 dichroic and 510-525 nm bandpass 
filter. 

After treatments, but prior to microscopy, cells were routinely fixed with 4% formaldehyde 
in phosphate buffered saline (PBS) for 10 minutes, followed by 3 wash steps using PBS. 

5 Figure 20 shows a montage of CHO cells expressing ps1809 with ps1827, imaged from 4 
separate treatments. The top row (Figure 20 a-d) are images of GFP fluorescence. The 
GFP is localised to nuclei only of these cells, as demonstrated in the lower row of images 
(Figure 20 e-h) where transmitted-light images have been overlayed with their 
corresponding GFP fluorescence. In Figure 20 c & d, the cells were treated for 16 hours 

10 with 5 mM lithium chloride to inhibit the kinase GSK3|3, directly after transfer to the Lab- 
Tek chambered cover glasses; cells in Figure 20 a & b received no lithium during this 
time. Cells in Figure 20 b & d were then treated with 3 \xM rapamycin (in ethanol stock, 
final concentration of ethanol was 0.6% v/v), whilst those in Figure 20 a & c received only 
0.6% ethanol. Both treatments were continued for a further 6 hours before the cells were 

15 fixed and imaged. 

Without lithium treatment (Figure 20 a & b) the level of fluorescence is very weak in most 
cells and may be due entirely to cellular autofluorescence from flavanoids such as FAD 
(the emision filter used typically allows a fraction of FAD autofluorescence to pass); 
Occasional cells show a weak GFP fluorescence in nuclei, apparently the result of 

20 spontaneous complementation that may accompany nuclear membrane breakdown at cell 
division. When treated for 1 6 hours with 5 mM LiCI, the number of cells containing 
fluorescent GFP increases greatly, and many of these have the GFP fluorescence 
confined exclusively to their nuclei consistent with the distribution of GFP 
complementation expected from this pair of fusion constructs (Figure 20 c & d). The level 

25 of nuclear fluorescence in Li-treated cells increases substantially with rapamycin 
treatment (Figure 20 d), the dimeriser increasing both the number of cells with bright 
nuclei, and the average intensity of GFP in each responding cell. 



In a screening context, the present example show that in the untreated cells (a) no 
spontaneous complementation takes place (compared to Figure 19 (A)), thereby avoiding 
30 the "sink" effect). 
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These results indicate that the GFP complementation method can be configured to 
monitor the translocation of a protein component within living cells, and that the efficiency 
of the method, as measured by signal to background ratio, can be significantly improved 
by inclusion of an inducible protein-protein interaction linkage such as provided by 
5 FKBP:FRB in the presence of rapamycin. Such a configuration of the system could be 
used to construct high throughput intensity-based fluorescence assays to identify 
compounds (or other influences) that act to modulate translocations of cellular 
components. 
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Figure legends 

Figure 1 

General structures of the fusion protein coding sequences. 
Figure 2 

5 16 bit images of fluorescent CHO-hIR cells co-transfected with NtermEGFP-NZ and CZ- 
CtermEGFP expression vectors or transfected with pEGFP-C1 were taken and scaled 
individually to visualise the cells and the fluorescence distribution within them. Because of 
the pixel intensity scaling, the relative fluorescence levels cannot be compared among the 
images. The splitting sites are either between residues 157/158 (top row, plasmids 
10 PS 1557 and PS 1559) or between residues 172/173 (middle row, plasmids PS1558 and 
PS1560). The EGFP expression vector pEGFP-C1 was transfected into the cells in the 
bottom row. The images were taken 1 day (left column), 2 days (middle column), or 10 
days (right column) after transfection. The images of the cells are representative of the 
cells that expressed functionally complementing fragments. 

15 Figure 3 

The same 16 bit images of fluorescent CHO-hIR cells co-transfected with NtermEGFP-NZ 
and CZ-CtermEGFP expression vectors or transfected with pEGFP-C1 as shown in 
Figure 2 but the images are now shown with the same intensity scaling to allow 
comparison of fluorescence intensities. The cells that are transfected with 
20 complementation constructs that are based on a split between residues 172 and 173 
(middle row) are clearly more fluorescent than the cells that are transfected with 
complementation constructs that are based on a split between residues 157 and 158 (top 
row). However, the cells transfected with the pEGFP-C1 construct (bottom row) show 
significantly stronger fluorescence at day 2. 

25 Figure 4 

The unmanipulated microscope images shown in Figure 3 were analysed using the 
ImageJ software package and data analysis was performed in Microsoft Excel. For each 
16-bit monochrome IP Lab microscope image, pixel intensity data were produced in 
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ImageJ and exported to an Excel spread-sheet for data analysis. The darkest and 
brightest 0.5% of the pixels were identified in each image and the average intensities of 
these two groups of pixels were calculated. The average intensity of the 0.5% darkest 
pixels was defined as the back ground fluorescence intensity (shown as white bars in the 
5 histogram) and the intensity of the 0.5% brightest pixels was defined as the maximum 
intensity.. The difference in intensity between the maximum intensity and the background 
intensity was defined as the response (shown as cross hatched bars in the histogram). 
The sum of the background intensity and the response is equal to the maximum intensity. 
From the figure, it is clear that EGFP based fluorescence complementation using a split 
10 between residues 172 and 173, and probably anywhere else in this loop, is greatly 
superior to EGFP based fluorescence complementation using a split between residues 
157 and 158 and probably also to splits anywhere else in this loop. 

Figure 5 

Positions of appropriate fluorescent protein splitting sites are shown on ribbon and wire 
15 frame representations of GFP. The two representations show the same sites from two 
sides (molecule rotated approximately 180 degrees around a vertical axis). 

Figure 6 

Co-transfection of expression vectors expressing EGFP and EYFP based 
complementation fragments as described in Figure 3 to compare the abilities of the 
20 various complementation fragments to combine in cells and produce functional 

complexes. All images are scaled identically to allow direct comparison of fluorescence 
intensities between the images. 



25 



Single transfections with N-terminal fragments only resulted in no detectable fluorescence 
above the background level (data not shown). These N-terminal fragments contain amino 
acid residues 65-67 forming the chromophore in full-length GFP. 
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Figure 7 

Quantitative analysis of the images shown in Figure 6. The results are in accord with the 
impressions from visual inspection of the cells. The data were produced as described in 
the legend to Figure 4. 



5 Figure 8 

Co-transfection of expression vectors expressing EGFP and EYFP based 
complementation fragments as described in Figure 3 to compare the effects of mixing 
differently colored EGFP, EYFP and EYFP F64L fragments and to determine the 
influence of overlapping fragments, e.g. combining fragments encoding residues 1-172 

10 and 158-238. All color combinations complement but typically less efficiently than in the 
correct combinations, i.e. when no or few residues overlap. Fragments having overlapping 
regions are also functional and this may be advantageous in experiments where longer 
linker sequences are or may be required by the fusion partners due to steric hindrance. 
This was not the case in this experiments where the fusion partners are leucine zippers. 

15 In the example (middle column), residues 158-172 were present in both fragments. In all 
situations, the F64L has a favorable effect on the fluorescence intensities. All images are 
scaled identically to allow direct comparison of fluorescence intensities between the 
images. 

Figure 9 

20 Quantitative analysis of the images shown in Figure 8. The results can be compared 
directly with the results shown in Figure 7 and they are in accord with the impressions 
from visual inspection of the cells. The data were produced as described in the legend to 
Figure 4. 



Figure 10 

25 CHO-hIR [PS1767 + PS1768] cells at 3 time points after treatment with 1 jaM rapamycin. 
Note that image (c) was taken at 25 msec exposure, the previous 2 images at exposures 
of 100 msec each. 
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(a) is the starting condition for these cells (t=0), and fluorescence is barely visible in most 
cells, although it was noted that some ceils (< 5%) in the population had low, but 
appreciable, fluorescence before treatment. 

(b) , after 4 hours many cells (approximately 40%) had developed significantly greater 
5 EYFP fluorescence throughout the cytoplasmic and nuclear locations. 

(c) after 16 hours the response per cell had increased further and encompassed a larger 
proportion of the cell population (approximately 70%). 

Figure 11 

The rate of development of cellular EYFP fluorescence following rapamycin treatment of 
10 the CHO-hIR [PS1767 + PS1768] line. Cells were treated in 96-well plates with 3 (xM 
rapamycin and the fluorescence measured. Treatment and measurements were made 
with the cells growing in HAM's medium + 10% FBS, and fluorescence measurements 
were corrected for the background fluorescence from this medium. The graph 
. demonstrates that the half-time for development of fluorescence is approximately 5 hours. 
15 Values corrected for HAM's background, each value a mean + sd for 8 measurements. 

Figure 12 

Response curve to different rapamycin doses for the CHO-hIR [PS1769 + PS1771] cell 
line. Cells were cultured in 96-well plates, treated with various rapamycin doses for 16 
hours, then fixed and stained with Hoechst prior to determination of EYFP 
20 fluorescence/cell (arbitrary units) on the Ascent plate reader. Values are corrected for 
PBS background as well as cell number. The cell line shows approximately a 3-fold 
increase in the EYFP intensity/cell over the dose range of rapamycin used in this 
experiment. 



Figure 13 

25 Each of the cell lines were fluorescence activated cell sorted (FACS) into 3 groups: (i) 
most green group (ii) medium to low-green group and (iii) black group. The 'most green' 
was discarded in each case, while the other 2 groups were cultured for further use. 

A: CHO-hIR [ps1768 + ps1767] FACS group 'Black ? before stimulation (i), and after 16 
hours stimulation with 100 nM rapamycin (ii) & (iii). images (i) and (ii) were exposed for 
30 100 msec, image (iii) for 25 msec. 
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B: CHO-hIR [ps1768 + ps1767] FACS group 'medium-low green' before stimulation (i), 
and after 16 hours stimulation with 100 nM rapamycin (ii) & (iii). Images (i) and (ii) were 
exposed for 100 msec, image (iii) for 25 msec. 



Figure 14 

5 Show the response of the 'medium to low-green' (a) and 'black' (b) FACS groups 

(respectively) derived from the CHO-hIR [PS1767 + PS1768] parent line (see Figure 13). 
Dose response to rapamycin was measured after 7 hours (i) and 30 hours (ii) for each cell 
line. Values for fluorescence have been corrected for plate & medium background. 
Increase in EYFP fluorescence is better than 20-fold the unstimulated value in each case. 

10 Figure 15 

Show dose-response competition curves for FK506 versus 100 nM rapamycin in two of 
the FACS'd lines, (a) CHO-hIR [PS1768 + PS1767] 'mid to low-green' group, and (b) 
CHO-hIR [PS1769 + PS1771] 'black' group. EC 50 values in both cases are approximately 
1.2 pM FK506. The cells were incubated overnight (16 hours) with mixtures of the two 
15 compounds, then fixed and stained with Hoechst prior to detemination of EYFP 

fluorescence/cell on an Ascent plate reader. Plate and solution backgrounds have been 
subtracted; the dashed lines on each graph indicate the prestimulated fluorescence levels 
for each cell line in these experiments. 

Figure 16 

20 Alignment of fluorescent proteins. 

Figure 17 

Crystal structure of rapamycin induced binding of FRBP12 (left) bound to FRB (right). The 
C-termini of the molecules are within 15A of each other. The N-terminus of one of the 
molecules is 32-36A away from the C-terminus of the other molecule. 
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Figure 18 

(A) shows the level of fluorescence in stable CHO[ps1706 + ps1810] 
(NtermE[F64L]YFP172-p-catenin + H2B-TCF4(frag)- CtermEYFP173) prior to any. 
treatment of the cells. 

The level of fluorescence is very weak in most cells and may be due entirely to cellular 
autofluorescence from flavanoids such as FAD (the emision filter used typically allows a 
fraction of FAD autofluorescence to pass). Occasional cells (top right corner of (A)) show 
a weak GFP fluorescence in nuclear and cytoplasmic locations, apparently the result of 
spontaneous complementation. Most of these cells are found clustered, possibly 
indicating a clonal behaviour that may reflect incorrect processing of the introduced 
components, such that they are brought together into the same location during expression 
(or possibly cell division) where complementation can occur. However, when treated for 
16 hours with 5 mM LiCI, the number of cells containing fluorescent GFP increases 
greatly, and many of these have the GFP fluorescence confined exclusively to their nuclei 
consistent with the distribution of GFP complementation expected from this pair of fusion 
constructs (B). 

Figure 19 

(A) shows the level of fluorescence in stable CHO[ps1736 + ps1758] 
(NtermE[F64L]YFP172-Zip-H2B + p-catenin-Zip-CtermEYFP173 prior to any treatment of 
20 the cells. When treated for 16 hours with 5 mM LiCI, a fraction of these cells display GFP- 
bright nuclei, consistent with the distribution of GFP complementation expected from this 
pair of fusion constructs ((B)) 

Figure 20 

This figure shows a montage of CHO cells expressing ps1809 with ps1827 
25 (NtermE[F64L]YFP172-FKBP-beta-Catenin + H2B-FRAP(2024-21 14)- CtermEYFP173), 
imaged from 4 separate treatments. The top row (a-d) are images of GFP fluorescence. 
The lower row of images (e-h) where transmitted-light images have been overlayed with 
their corresponding GFP fluorescence. 

a&e: No treatment. 



10 



15 



30 b&f: 3 jxM rapamycin (+6 hours) 
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c&g:16 hours with 5 mM lithium chloride 

d&h:16 hours with 5 mM lithium chloride 3 fiM rapamycin (+6 hours) 



Figure 21 

Figure showing the various assay configurations. 

5 IDC (Interaction Dependent Complementation) is the regular configuration where 

interaction between two proteins (here illustrated as BetaCat and Tcf4) will cause the N- 
and C- terminal XFP complementation partners to reassembly and be detectable. One 
particular advantage of IDC pairs is to validate TIDC assays developed from chosen 
protein pairs. The rapamycin inducible FKBP-FRB interaction pair can be used to validate 
10 and further develop the utility of IDC. See also the description of the TDCrap system. 

TIDC (Translocation and Interaction Dependent Complementation) is the configuration 
where redistribution and interaction is measured as an increase in light intensity. Beta- 
catenin fused to a complementation fragment is predominantly located in the cytoplasm. 
Upon stimulation, beta-catenin will translocate to the nucleus. Here, it will find the natural 
1 5 interaction partner TCF-4 fused to the complementing fragment. The two 

complementation partners will assemble and become fluorescent. Compounds that inhibit 
either beta-catenin translocation or beta-catenin-TCF-4 interaction will inhibit 
complementation and formation of the fluorescent complex. 



TDC (Translocation Dependent Complementation) is the configuration where only 
20 redistribution is measured as an increase in light intensity. For example, beta-catenin 
fused to a XFP complementation fragment and a leucine zipper sequence is 
predominantly located in the cytoplasm. Upon stimulation, beta-catenin fusion will 
translocate to the nucleus. Here, the leucine zipper will find the other leucine zipper 
sequence which is fused to the complementing XFP fragment and an anchor protein or 
25 nuclear localisation signal. The two complementation partners will assemble because of 
the zipper interactions and become fluorescent. Compounds that inhibit either beta- 
catenin translocation will inhibt complementation and formation of the fluorescent 
complex. 



30 



TDCrap (Translocation Dependent Complementation induced by rapamycin) is basically 
the same as TDC but the use of an inducible interaction pair, e.g. the rapamycin inducible 
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FKBP and FRB interaction pair, greatly reduces the "sink effect" and facilitates the 
development of assays based on other types of translocation, e.g. from the cytoplasm to 
plasma membrane or visa versa. 



Figure 22 

5 The left hand part of this figure shows the assay configuration for determining dimerization 
(in this case homo-dimerization) of receptors. Two conjugates comprising the receptor 
and the N-terminus al of the complementation protein (here YN) or the C-terminus al of 
the complementation protein (here YC) are expressed in the cell. Only upon stimulation of 
the cell with a ligand, the receptors are brought in close apposition and the YN and YC 
10 terminal fragments will from a functional protein. 

The right hand side of the figure illustrates the configuration of an assay for caspase 
activity. The first conjugate comprises the N-terminus al of the complementation protein 
(YN), a constitutive interaction partner (a leucine zipper), a nuclear localisation sequence 
(NLS), a consensus cleavage site for caspases (e.g. VAD = Valine-Alanine-Aspartate) 

15 fused to an anchor protein, here in the cell membrane. The second conjugate comprises 
the N C-terminus al of the complementation protein (YC), the interaction partner (a leucine 
zipper) and an anchor in the nucleus (here marked as a circle). When caspase activity is 
increased, the caspases will cleave the YN fusion at the VAD site and the first conjugate 
will, due to the NLS, translocate to the nudes, where binding to the interaction partners 

20 (zippers), will cause fluorescence. 
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Table 2 Oligo nucleotides used in cloning. Oligo nucleotides beginning with P* are 



phosphorylated at the 5 f end to permit ligation. 

Oligo Oligo nucleotide sequence (5' end to 3' end) SEQ 

nucleo ID 

-tide NO: 

1268 ATTB2- CCTACTGCTTTGAGATTCGTCGG 15 

1272 ATTB2- GTCATTCCAGTTTTAGAAGCTC 16 

1 282 CAGACAATCTGTGTGGGCACTCGACCGG 1 7 

2110 P * CATGGCCGGTGGTACCGGTTCCGGTGCCCTGAAGAAGGAGCTGCAGG 18 

2111 P*AGCTCCTTCTTCAGGGCACCGGAACCGGTACCACCGGC 19 

2112 P * CCAACAAGAAGGAGCTGGCCCAGCTGAAGTGGGAGCTGCAG 2 0 

2113 P*CTCCCACTTCAGCTGGGCCAGCTCCTTCTTGTTGGCCTGC 21 

2114 ' P *GCCCTGAAGAAGGAGCTGGCCCAGTAG 22 

2115 P * GATCCTACTGGGCCAGCTCCTTCTTCAGGGCCTGCAG 2 3 

2116 P * C ATGGCCAGCGAGCAGCTGGAGAAGAAGCTGCAGGCCCTG 2 4 

2117 P * CCTGCAGCTTCTTCTCCAGCTGCTCGCTGGC 2 5 

2118 P * GAGAAGAAGCTGGCCCAGCTGGAGTGGAAGAACCAGGCCCTGGAG 2 6 

2119 P * GGCCTGGTTCTTCCACTCCAGCTGGGCCAGCTTCTTCTCCAGGG 2 7 

2120 P * AAGAAGCTGGCCCAGGGCGGCACCGGTTAG 2 8 

2121 P * GATCCTAACCGGTGCCGCCCTGGGCCAGCTTCTTCTCC AG 2 9 

2128 GGCGCCATGGTGAGCAAGGGCGAG 30 

2129 GCCGGACCGGTACCACCGTTGTACTCCAGCTTGTG 3 1 

2130 GCCGGACCGGTACC ACCCTGCTTGTCGGCC ATG 3 2 

2131 GCCGGACCGGTACCACCCTCGATGTTGTGGCGGATC 3 3 

2132 CCCCGGATCCTACTTGTACAGCTCGTCCATGC 34 

21 33 GGCGCCATGGGCACCGGTTACAACAGCCACAACGTC 3 5 

2134 GGCGCCATGGGC ACCGGTAAGAACGGCATCAAGGTG 3 6 

21 35 GGCGCCATGGGCACCGGTGACGGCAGCGTGCAGCTC 3 7 

22 1 9 GGGGGCTAGCGCCACCATGGTGAGCAAGGGCGAG 3 8 

2222 GCGGGGGATCCGATATCGCCAGAGCCAGAGCCAGAGCCCTCGATGTTGTGGCGGATC 3 9 

2225 GGGGGCTAGCGATATCCGGCTCTGGCTCTGGCTCTGGCGACGGCAGCGTGCAGCTC 4 0 

2333 GCCCACCCTCGTGACCACCTTCGGCTACGGCCTGCAGTGCTTCGCCCGCTACCCCGACC 4 1 
ACATG 



2334 CATGTGGTCGGGGTAGCGGGCGAAGCACTGCAGGCCGTAGCCGAAGGTGGTCACGAGGG 4 2 
TGGGC 

2335 GCCCACCCTCGTGACCACCCTGGGCTACGGCCTGCAGTGCTTCGCCCGCTACCCCGACC 4 3 
ACATG 

2336 CATGTGGTCGGGGTAGCGGGCGAAGCACTGCAGGCCGTAGCCCAGGGTGGTCACGAGGG 4 4 
TGGGC 
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Oligo Oligo nucleotide sequence (5 1 end to 3' end) SEQ 

nucleo ID 

-tide NO: 

2337 GACAACCACTACCTGAGCTACCAGTCCGCCCTGAGC 4 5 

2338 GCTCAGGGCGGACTGGTAGCTCAGGTAGTGGTTGTC 4 6 

2442 ATTB1- CCAC CATGGGAGTGCAGGTGGAAACC 4 7 

2443 ATTB2- CTTCCAGTTTTAGAAGCTC 4 8 

2444 ATTB1- CCACCATGGAGATGTGGCATGAAGGCCTG 4 9 

2445 , ATTB2- CCTGCTTTGAGATTCGTCGGAACAC 5 0 

9655 TCCTAGGTCAGTCCTGCTCCTCGGCCACGAAGTGCAC 5 1 
TCCTAGGCTGCAGCACGTGTTGACAATTAATCATCGG 

9658 CAGACAATCTGTGTGGGCACTCGACCGG 52 

2387 GCGGGGGATCCGATATCGCCAGAGCCAGAGCCAGAGCCctgggccagctccttctt cag 5 3 
2368 attB2- cctacttggagctggtgtacttggtgac 54 

2388 GGGGGCTAGCGATATCCGGCTCTGGCTCTGGCTCTGGCGACGGCAGCGTGCAGCI'C 55 

2359 attBl- ccaccatggctactcaagctgatttgatg 56 
2439 attB2- ccaggtcagtatcaaaccaggccag 57 

2360 attB2- cctacaggtcagtatcaaaccaggccag 58 

2389 gttgctagcaccatgcctgagccagccaagtctgct 59 

2390 gttgatatcccttggagctggtgtacttggtgac 60 
2364 attBlq- ccaccatgccgcagctgaacggcggtgga 61 

2366 attB2- caggcggaggccgtctttccgc 62 

2462 gttcagctgatgggagtgcaggtggaaaccatctcc 63 

2463 gttggatccgatatcaccggtgcctgtcccagttccttccagttttagaagctccacat 64 
c 

2464 gttgctagcgatatccggaacgggcactgggaccggtgagatgtggcatgaaggcctgg 65 
aag 

2465 gttcccgggagctgctttgagattcgtcggaacac 66 

2367 attBl- ccaccatgcctgagccagccaagtctgct 67 
2407 attB2- ccttggagctggtgtacttggtgac 68 
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Table 3 Primer pairs used in EGFP fragment amplification 



Protein encoded by PCR fragment 


5' primer 


3' primer 


EGFP(1-144) 


2128 


2129 


EGFP(1-157) 


2128 


2130 


EGFP(1-172) 


2128 


2131 


EGFP(1 45-238) 


2133 


2132 


EGFP(1 58-238) 


2134 


2132 


EGFP(1 73-238) 


2135 


2132 
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Table 4 Cloning and expression vectors 



Vector Expressed protein Promoter Selection 



E.coli/mamm. 


pcurr v./ I 


torr 


CMV 


kan/neo 




prspp 


CMV 


zeo/zeo 


nTrp I— I i o A 

p i 1 onio~M 


no insen 


Trc 


amp/none 


PO 1 O 1 O 


NZ leucine zipper 


Trc 


amp/none 


r'O 1 O 1 D 


CZ leucine zipper 


Trc 


amp/none 


rOlDl4 


Ntermtorr144-NZ 


Trc 


amp/none 


PQi KQft 

ro i oyo 


NteriTlfciorPI D/-NZ 


Trc 


amp/none 


rol oy / 


NtermbUrrl 72-NZ 


Trc 


amp/none 


rOlDl O 


oz-cterm to r P 145 


Trc 


amp/none 


PO i oy^f 


cz-ctermtijpp loo 


Trc 


amp/none 




oz-v>term izo r p i 


Trc 


amp/none 


po 1 ooy 


NtermtorPi 57-nz 


CMV 


kan/neo 


roioou 


NtermbCjhPi 7z-NZ 


CMV 


kan/neo 


PO I OD( 


oz-LfterrntorPi oo 


CMV 


zeo/zeo 


PQi CCQ 
r O 1 OOO 


oz.-oiermtzorp i /o 


CMV 


zeo/zeo 


DQ «1 COO 

Pol boy 


NtermEYF P1 57-NZ 


CMV 


kan/neo 


PS 1642 


NtermEYFPI 72-NZ 


CMV 


kan/neo 


PS 1640 


(NtermE[F64L]1 57YFP-NZ 


CMV 


kan/neo 


PS1641 


NtermE[F64L]YFP172-NZ 


CMV 


kan/neo 


PS1637 


CZ-Cterm EYFP 158 


CMV 


zeo/zeo 


PS 1638 


CZ-CtermEYFP173 


CMV 


zeo/zeo 


PS1769 


NtermE[F64L]YFP172-FKBP 


CMV 


kan/neo 


PS1767 


N term E[F64L] YF P 1 72- F RB 


CMV 


kan/neo 


PS1771 


FRB-CtermEYPF1 73 


CMV 


zeo/zeo 


PS 1768 


FKBP-CtermEYFP173 


CMV 


zeo/zeo 



Table 5 Sequence names and numbers 



SEQ ID NO: Name 



1 


Amino acid sequence of GFP 


2 


Amino acid sequence of GFP Y66W 


3 


Amino acid sequence of GFP Y66H 


4 


Amino acid sequence of EGFP 


5 


Amino acid sequence of EYFP 


6 


Amino acid sequence of EYFP F64L variant 
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7 Nucleic acid sequence of NZ 

8 Amino acid sequence of NZ 

9 Nucleic acid sequence of CZ 

1 0 Amino acid sequence of CZ 

1 1 NtermE[F64L]YFP1 72 and FKBP linker sequence 

12 Nterm E[F64 L] YFP 1 72 and FRB linker sequence 

13 FRB and CtermEYFP173 linker sequence 

14 FKBP and CtermEYFP173 linker sequence 
1 5-68 Primer sequence (see Table 2) 

89 beta-Catenin and CZ-CtermEYFP173 linker sequence 

70 Nterm E[F64L]YFP1 72 and beta-Catenin linker sequence 

71 TCF4(1-70) and CtermEYFP173 linker sequence 

72 Nterm E[F64L] YFP 1 72 and FKBP and beta-Catenin . 

73. H2B and FRAP (amino acids 2025-21 14) linker sequence 

74 Nterm E[F64L] YFP 1 72 -NZ and H2B linker sequence 

75 H2B and TCF4(1-70) linker sequence 

76 FKBP and beta-Catenin linker sequence 



All cited patens, publications, copending applications, and provisional applications 
referred to in this application are herein incorporated by reference. 

The invention being thus described, it will be obvious that the same may be varied in 
5 many ways. Such avariations are not to be regarded as a departure from the spirit and 
scope of the present inventions, and all such modifications as would be obvious to one 
skilled in the art are intended to be included within the scope of the following claims. 
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Claims 

1 . A cell comprising at least: 

- a first conjugate comprising a first protein and the N-terminal fragment of 
5 complementation protein; and 

- a second conjugate comprising a second protein and the C-terminal fragment of the 

complementation protein; 

wherein said first conjugate has a predominant location in a different cellular location from 
where said second conjugate is predominantly located, 
10 wherein said first protein and said second protein will bind to each other, and 

wherein said complementation protein exhibits altered characteristics when the two 
fragments of the complementation protein are brought into close apposition and the 
two fragments of the complementation protein form the full functional protein. 

1 5 2. The cell according to any claim 1 , wherein the cellular compartment of the first 
conjugate is determined by the nature of the first protein. 



3. The cell according to any of the preceding claims, wherein the cellular compartment of 
the second conjugate is determined by the nature of the second protein. 

4. The cell according to any of claims 1 and 2, wherein the second conjugate further 
20 comprises an anchor protein, wherein said anchor protein is anchored in a different 

cellular compartment from where said first protein is predominantly located. 

5. A cell comprising at least: 

- a first conjugate comprising a first protein, interaction partner A and the first terminal 
fragment of a complementation protein; and 

25 - a second conjugate comprising a interaction partner B and the second terminal fragment 
of the complementation protein; 
wherein said first conjugate has a predominant location in a different cellular compartment 

from where said second conjugate is predominantly located, 
wherein said complementation protein exhibits altered characteristics when the two 
30 fragments of the complementation protein are brought into close apposition and the 
two fragments of the complementation protein form the full functional protein. 
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6. The cell according to claim 5, wherein said interaction partner A and interaction partner 
B bind to each other. 

7. The cell according to claim 5, wherein said interaction partner A and interaction partner 
5 B bind to each other only when an interaction stimulus has been applied. 

8. The cell according to any of claims 5-7, wherein the second conjugate further 
comprises an anchor protein, wherein said anchor protein is anchored in a different 
cellular compartment from where said first protein is predominantly located. 

9. The cell according to any of the preceding claims, wherein complementation protein is 
10 dihydrofolate reductase (DHFR). 

10. The cell according to any of claim 1-8, wherein the complementation protein is 
Ubiquitin. 

1 1 . The cell according to any of claim 1-8, wherein complementation protein is p- 
lactamase. 

15 12. The cell according to any of claim 1-8, wherein complementation protein is p- 
galactosidase. 

13. The cell according to any of claim 1-8, wherein complementation protein is a 
fluorescent protein. 

14. The ceil according to claim 13, wherein the complementation protein is Green 
20 Fluorescent Protein (GFP). 

15. The cell according to claim 14, wherein the complementation protein is derived from 
Aequoria victoria. 



16. The cell according to claim 14, wherein N-terminal fragment of the complementation 
25 protein is an N-terminal fragment of GFP, comprising a continuous stretch of amino acids 
from amino acid number 1 to amino acid number X of GFP, wherein the peptide bond 
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between amino acid number X and amino acid number X+1 is within a loop of GFP and 
C-terminal fragment of the complementation protein is a C-terminal fragment of GFP, 
comprising a continuous stretch of amino acids from amino acid number X+1 to amino 
acid number 238 of GFP. 



5 17. The cell according to claim 14, wherein the N-terminal fragment of the 

complementation protein is an N-terminal fragment of GFP, comprising a continuous 
stretch of amino acids from amino acid number 1 to amino acid number X of GFP, 
wherein the peptide bond between amino acid number X and amino acid number X+1 is 
within a loop of GFP and the C-terminal of the complementation protein is a C-terminal 
10 fragment of GFP, comprising a continuous stretch of amino acids from amino acid number 
Y+1 to amino acid number 238 of GFP, wherein Y<X creating an overlap of the two GFP 
fragments, and wherein the peptide bond between amino acid Y and amino acid Y+1 is 
within a loop of GFP. 

18. The cell according to any of claim 14-17, wherein GFP is selected from the group 
15 consisting of EGFP, EYFP, ECFP, dsRed and Renilla GFP. 

19. The cell according to claim 18, wherein the GFP is EGFP. 

20. The cell according to claim 18, wherein the GFP is EYFP. 



21. The cell according to any of claim 14-20, wherein the amino acid in position 1 
preceding the chromophore has been mutated to provide an increase of fluorescence 

20 intensity. 

22. The cell according to claim 21 , wherein the amino acid F in position 1 preceding the 
chromophore has been substituted by L. 

23. The cell according to any of claim 14-22, wherein the GFP has been mutated to 
further contain the S72A mutation. 



25 24. The cell according to any of claim 16-23, wherein X is between 9 and 10 within the 
Thr9-Val11 loop; or between 23 and 24 within the Asn23-His25 loop; or between 38 and 
39 within the Thr38-Gly40 loop; or between 48 and 55 within the Cys48-Pro56 loop; or 
between 72 and 75 within the Ser72-Asp76 loop; or between 81 and 82 within the His81- 
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Phe83 loop; or between 88 and 89 within the Met88-Glu90 loop; between 101 and 102 
within the Lys101-Asp103 loop; or between 1 14 and 117 within the Phe114-Thr118 loop; 
or between 128 and 144 within the lie 128-Tyr145 loop; or between 154 and 159 within 
the Ala154-Gly160 loop; or between 171 and 174 within the Ile171-Ser175 loop; or 
5 between 188 and 196 within the Ile188-Asp197 loop; or between 210 and 214 within the 
Asp210-Art215 loop. 

25. The cell according to claim 24, wherein X is between 154 and 159 within the Ala154- 
Gly160loop. 

26. The cell according to claim .25, wherein X is 157 within the Ala154-Gly160 loop. 

10 27. The cell according to claim 24, wherein X is between 171 and 174 within the Ile171- 
Ser175loop. 

28. The cell according to claim 27, wherein X is 172 within in Ile171-Ser175 loop. 

29. The cell according to claim 17, wherein Y is between 154 and 159 within the Ala154- 
Gly160 loop. 

15 30. The cell according to claim 29, wherein Y is 157 within the Ala154-Gly160 loop. 

31 . The cell according to any of claims 16-30, wherein X is 172 within in Ile171-Ser175 
loop and wherein Y is 157 within the Ala154-Gly160 loop. 

32. The cell according to any of the preceding claims, wherein the N-terminal fragment of 
20 the complementation protein is fused in frame with a first protein of interest. 

33. The cell according to any of the preceding claims, wherein the first protein is fused to 
the N-terminal of the N-terminal fragment of the complementation protein. 

34. The cell according to any of the preceding claims, wherein the first protein is fused to 
the C-terminal of the N-terminal fragment of the complementation protein. 



25 



35. The cell according to any of the preceding claims, wherein the C-terminal fragment of 
the complementation protein is fused in frame with the second protein. 
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36. The cell according to any of the preceding claims, wherein the second protein of 
interest is fused to the N-terminal of the C-terminal fragment of the complementation 
protein. 

37. The cell according to any of the preceding claims, wherein the second protein of 
5 interest is fused to the C-terminal of the C-terminal fragment of the complementation 

protein. 

38. The cell according to any of the preceding claims, wherein the N-terminal fragment of 
the complementation protein fused in frame to a first protein further comprises a linker 
sequence between the N-terminal fragment of the complementation protein and the first 

10 protein. 

39. The cell according to any of the preceding claims, wherein the C-terminal fragment of 
the complementation protein fused in frame to a second protein further comprises a linker 
sequence between the C-terminal fragment of the complementation protein and the 
second protein. 

15 40. The cell according to any of the preceding claims, wherein the GFP is EYFP further 
containing an F64L mutation, wherein X is 172, wherein the first protein fused to the N- 
terminal fragment of GFP is fused to the C-terminal of the N-terminal fragment of GFP 
and wherein the second protein fused to the C-terminal fragment of GFP is fused to the N- 
terminal of the C-terminal fragment of GFP. 

20 41 . The cell according to any of the preceding claims, wherein the GFP is EYFP further 
containing an F64L mutation, wherein X is 157, wherein the first protein fused to the N- 
terminal fragment of GFP is fused to the C-terminal of the N-terminal fragment of GFP 
and wherein the second protein fused to the C-terminal fragment of GFP is fused to the N- 
terminal of the C-terminal fragment of GFP. 

25 

42. The cell according to any of the preceding claims, wherein the first conjugate is 
expressed in the cell. 



43. The cell according to any of the preceding claims, wherein the second conjugate is 
expressed in the cell. 
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44. The cell according to any of the preceding claims, wherein interaction partner A is 
FKBP12, interaction partner B is FRAP (or vise versa) and the interaction stimulus is 
Rapamycin. 

5 45. The cell according to any of the preceding claims, wherein interaction partner A is 
FKBP12, interaction partner B is FRB (T2098L) (or vise versa) and the interaction 
stimulus is Rapamycin. 

46. The cell according to any of the preceding claims, wherein interaction partner A is 
FKBP12, interaction partner B is FRB (T2098L) (or vise versa) and the interaction 

10 stimulus is AP21 967. 

47. The cell according to any of the preceding claims, wherein interaction partner A is 
FKBP12, interaction partner B is FKBP12 (or vise versa) and the interaction stimulus is 
AP21967. 

48. The cell according to any of the preceding claims, wherein interaction partner A is 
15 FKBP12, interaction partner B is FKBP12 (or vise versa) and the interaction stimulus is 

AP21967. 

49. The cell according to any of the preceding claims, wherein interaction partner A and B 
are steroid hormone receptors and the dimerizing agent is the cognate hormone ligand. 

50. The cell according to any of the preceding claims, wherein interaction partner A and B 
20 are estrogen receptors and the interaction stimulus agent is e!strogen. 

51 . The cell according to any of the preceding claims, wherein the anchor protein is a 
protein containing the transmembrane domain of the epidermal growth factor receptor 
(EGFR), or containing the transmembrane domain of one of the integrin protein family, or 

25 containing the myristoylation sequence from c-Src (residues 1-14). 



52. The cell according to any of the preceding claims, wherein the anchor protein is a 
histone protein or a protein normally restricted to nucleoli, for example the p120 nucleolar 
protein. 
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53. The cell according to any of the preceding claims, wherein the anchor protein is a 
protein normally confined to mitochondrial outer or inner membranes for example VDAC, 
F 0 subunit of ATP-ase, or NADH dehydrogenase. 

54. The cell according to any of the preceding claims, wherein the anchor protein is a 
5 protein normally confined to the various different regions of Golgi bodies for example 

TGN38 or ADAM12-L. 

55. The cell according to any of the preceding claims, wherein the anchor protein is a 
protein normally confined to focal adhesion complexes for example P125, FAK, integerin 
alpha or beta, or paxillin. 

10 56. The cell according to any of the preceding claims, wherein the anchor protein is a 
protein normally associated with cytoskeletal structures such as F-actin strands or micro 
tubular bundles for example MAP4, actin binding domain of alpha-actinin, kinesins, 
myosins or dyniens. 

57. The cell according to any of the preceding claims, wherein the anchor protein is a 

15 protein normally associated with nuclear material or nuclear components, such as histone 
proteins, including histones H1, H2A, H2B, H3, H4, and variants thereof. 

58. The cell according to any of the preceding claims, wherein the anchor protein is a 
protein normally associated with nuclear membrane such as A and B type lamins, or 
associated with nuclear bodies such as splicing bodies, Cajal bodies, PML nuclear bodies 

20 (PML oncogenic domains, PODS), or transcription engines such as RNA polymerase 
POL-II. 

59. A cell according to any of the preceding claims, where in the cell is an eukaryotic cell. 

60. A cell according to any of the preceding claims, where in the cell is a prokaryotic cell. 

25 61. A method for detecting protein-protein interactions comprising the steps of: 
(a) providing a cell comprising at least: 

- a first conjugate comprising the first protein of interest and the first terminal fragment 
of a complementation protein; and 
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- a second conjugate comprising the second protein of interest and the second terminal 
fragment of a complementation protein; 

wherein said second protein of interest has a predominant location in a different 
cellular location from where said first protein of interest is predominantly located, 
5 (b) determine if complementation partner A and complementation partner B has 
complemented; 

complementation between complementation partner A and complementation partner B 
being indicative that the first protein of interest has translocated from the predominant 
location of this first protein of interest to the cellular compartment where the second 
10 protein of interest is located and that the two proteins of interest interact. 



62. A method for detecting protein-protein interactions comprising the steps of: 

(a) providing a cell according to any of claim 1 to 60, and 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

15 complementation between complementation partner A and complementation partner B 
being indicative that the first protein of interest has translocated from the predominant 
location of this first protein of interest to the cellular compartment where the second 
protein of interest is located and that the two proteins of interest interact. 



63. A method for testing if a compound prevents a protein-protein interaction comprising 
20 the steps of: 

(a) providing a cell according to any of claim 1 to 60; 
(a1 ) adding the compound to the cells of step (a); 

(a2) adding a translocation stimulus to the cells of step (a1), 

wherein the translocation stimulus causes the two conjugates to have the same cellular 
25 localization; 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

less complementation compared to a system omitting step (a1) being indicate that the 
compound is capable of preventing the interaction between the two proteins. 



30 64. A method according to any of claims 61-63, wherein the second conjugate further 
comprises an anchor protein, wherein said anchor protein is anchored in a different 
cellular compartment from where said first protein of interest is predominantly located. 
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65. A method for detecting translocation of a protein comprising the steps of: 

(a) providing a ceil comprising at least: 

- a first conjugate comprising said protein, the first terminal of the complementation 
5 protein, and interaction partner A; and 

- a second conjugate comprising an anchor protein, the second terminal of 
complementation protein, and interaction partner B; 

wherein said anchor protein is anchored in a different cellular compartment from where 
said protein is predominantly located; 
10 and wherein said interaction partner A and interaction partner B bind to each other; 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

complementation between complementation partner A and complementation partner B 
being indicative that said protein has translocated from the predominant location of the 
1 5 protein to the cellular compartment where the anchor protein is anchored. 

66. A method for detecting translocation of a protein comprising the steps of: 

(a) providing a cell according to any of claim 6-60, and 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

20 complementation between complementation partner A and complementation partner B 
being indicative that said protein has translocated from the predominant location of the 
protein to the cellular compartment where the anchor protein is anchored. 

67. A method for testing if a compound induces translocation of a protein comprising the 
steps of: 

25 (a) providing a cell according to any of claim 6-60; 
(a1) adding the compound to the cells of step (a); 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

increased complementation compared to a system omitting step (a1) being indicate that 
30 the compound induces translocation of the protein. 



68. A method for testing if a compound prevents translocation of a protein comprising the 
steps of: 

(a) providing a cell according to any of claim 6-60; 
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(a1 ) adding the compound to the cells of step (a); 

(a2) adding a translocation stimulus to the cells of step (a1): 

(b) determine if complementation partner A and complementation partner B has 
complemented; 

5 less complementation compared to a system omitting step (a1) being indicative that the 
compounds prevents translocation of the protein. 

69. A method for detecting translocation of a protein comprising the steps of: 

(a) providing a cell comprising at least: 

10 - a first conjugate comprising said protein, complementation partner A, and interaction 
partner A; and 

- a second conjugate comprising an anchor protein, complementation partner B, and 
interaction partner B; 

wherein said anchor protein is anchored in a different cellular compartment from where 
15 said protein is predominantly located, 

wherein said interaction partner A and interaction partner B bind to each other only 
when an interaction stimulus has been applied; 

(b) add an interaction stimulus; 

(c) determine if complementation partner A and complementation partner B has 
20 complemented; 

complementation between complementation partner A and complementation partner B 
being indicative that said protein has translocated from the predominant location of the 
protein to the cellular compartment where the anchor protein is anchored after addition of 
the interaction stimulus. 

25 70. A method for detecting translocation of a protein comprising the steps of: 

(a) providing a cell according to any of claims 7-60, and 

(b) add an interaction stimulus; 

(c) determine if complementation partner A and complementation partner B has 
complemented; 

30 complementation between complementation partner A and complementation partner B 
being indicative that said protein has translocated from the predominant location of the 
protein to the cellular compartment where the anchor protein is anchored after addition of 
the interaction stimulus. 
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71 . A method for testing if a compound induces translocation of a protein comprising the 
steps of: 

(a) providing a cell according to any of claim 7-60; 

(a1) adding the compound to the cells of step (a); 

5 (b) add an interaction stimulus; 

(c) determine if complementation partner A and complementation partner B has 
complemented; 

increased complementation compared to a system omitting step (a1) being indicate that 
the compound induces translocation of the protein. 

10 72. A method for testing if a compound prevents translocation of a protein comprising the 
steps of: 

(a) providing a cell according to any of claim 7-60; 
(a1 ) adding the compound to the cells of step (a); 
(a2) adding a translocation stimulus to the cells of step (a1): 
15 (b) add an interaction stimulus; 

(c) determine if complementation partner A and complementation partner B has 
complemented; 

less complementation compared to a system omitting step (a1) being indicative that the 
compound prevents translocation of the protein. 

20 73. A kit for detecting antigens comprising: 

(a) a first antibody that binds to the antigen; 

(b) a first conjugate comprising a protein binding to the first antibody and the N-terminal of 
a complementation protein; 

(c) a second conjugate comprising a protein binding to the first antibody and the C- 
25 terminal of the complementation protein; 

whereby binding of the first and second conjugate to the first antibody will bring the two 
terminals of the complementation protein so close that a functional protein is formed. 



74. A method for determining caspase activity in a cell comprising at least the steps of 
(a) providing a cell comprising at least: 
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- a first conjugate comprising the first terminal of a complementation protein, interaction 

partner A, a nuclear localisation sequence (NLS), a Valine-Alanine-Aspartate (VAD) 
sequence and an anchor protein; 

- a second conjugate comprising the second terminal of a complementation protein, 
5 interaction partner B and an anchor protein; 

wherein the anchor protein of the first conjugate is in a different cellular location from the 

nucleus, and the anchor protein of the second conjugate is in the nucleus and 
wherein interaction partner A, the NLS and the first terminal of the complementation 
protein are all located on the same side of the VAD sequence and the anchor protein is 
1 0 located on the other side of the NLS; 

(b) determine if the complementation protein has complemented 
complementation being indicative that caspase has cleaved the first conjugate and the 
free part has translocated to the nucleus, and partner A and complementation partner B 
has complemented; 

15 75. A nucleic acid encoding any of the conjugates, fragments or fusions proteins 
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EGFP NtermEGFPI 57-NZ NtermEYFPI 57-NZ NtermE[F64L]YFP1 57-NZ 
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EYFP NtermEGFPI 72-NZ NtermEYFPI 72-NZ NtermE[F64L]YFP1 72-NZ 

+ CZ-CtermEGFP173 + CZ-Cterm EYFP 173 + CZ-CtermEYFP173 
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NtermEYFPI 57-NZ NtermEYFPI 72-NZ NtermEYFPI 72-NZ 

+ CZ-CtermEGFP1 58 + CZ-CtermEYFP158 + CZ-CtermEGFP173 
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NtermE[F64L]YFP1 57-NZ NtermE[F64L]YFP1 72-NZ NtermE[F64L]YFP1 72-NZ 
+ CZ-CtermEGFP1 58 + CZ-CtermEYFP1 58 + CZ-CtermEGFP1 73 
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Fig. 16 16/23 

P42212 avGFP MS KGEE LFTG WP I LVELDGDV 22 

AY015996 rmGFP MSKQILKNTCLQEVMSYKVNLEGIV 25 

AF3 72525 rrGFP MD LAKLGLKEVMPTKINLEGLV 22 

AF16 8419 dsRed MRSSKNVIKEFMRFKVRMEGTV 22 

AF322222 asCP562 MASFLKKTMPFKTTIEGTV 19 

AF168422 asCP562 MAQSKHGLTKEMTMKYRMEGCV 22 

AF24 6709 asFP595 MASFLKKTMPFKTTIEGTV 19 

AF322221 asFP499 MYP S I KETMRVQL SMEGS V 19 

AF384683 mcGFP MS VI KP IME I KLRMQG W 18 

AF401282 mfGFP MSVIKPDMKIKLRMEGAV 18 

AF168424 CSFP484 MKCKFVFCLSFLVLAITNANIFLRNEADLEEKTLRIPKALTTMGVIKPDMKIKLKMEGNV 60 

AF168420 dsFP483 MSCSKSVIKEEMLIDLHLEGTF 22 

AY015995 spGFP MNRNVLKNTGLKEIMSAKASVEGIV 25 

AF168423 ZSFP538 MAHSKHGLKEEMTMKYHMEGCV 22 

AF168421 amFP486 MALSNKF I GDDMKMT YHMDGCV 22 

AY013824 amGFPxm MSKGEELFTGI VP VL I ELDGDV 22 

P42212 avGFP NGHKFS VSGEGEGDATYGKL - -TLKFICTTG-KLPVPWPTLVTTFSYGVQCFSRYPDHMK 79 

AY015996 rmGFP NNHVFTME GCGKGN I LFGNQ - - LVQ I RVTKGAP L P F AFD I VS P AFQ YGNRTFTKYPND I S 83 

AF372525 rrGFP GDHAFSMEGVGEGNI LEGTQ - - EVKI S VTKGAPLPFAFDI VS VAFS YGNRAYTGYPEE I S 80 



AF168419 dsRED NGHEFEIEGEGEGRPYEGHN- -TVKLKVTKGGPLPFAWDILSPQFQYGSKVYVKHPADIP 80 

AF322222 asCP562 NGHYFKCTGKGEGNPFEGTQ- -EMKIEVIEGGPLPFAFHILSTSCMYGSKTFIKYVSGIP 77 

AF168422 asCP562 DGHKFVITGEGIGYPFKGKQ- -AINLCWEGGPLPFAEDILSAAFNYGNRVFTEYPQDIV 80 

AF246709 asFP595 NGHYFKCTGKGEGNPFEGTQ - - EMKIEVIEGGPLPFAFHI LSTS CMYGSKTF IKYVSGI P 77 

AF322221 asFP499 NYHAFKCTGKGEGKPYEGTQ - -SIiNITITEGGPIiPFAFDILSHAFQYGIKVFAKYPKEIP 77 

AF384683 mcGFP NGHKFVIKGEGEGKPFEGTQ- -TINLTVKEGAPLPFAYDILTSAFQYGNRVFTKYPDDIP 76 

AF401282 mfGFP NGHKFVIEGDGKGKPFEGTQ - - SMDLTVKEGAPLPFAYDILTTVFDYGNRVFAKYPQD I P 76 

AF168424 CSFP484 NGHAFVI EGEGEGKPYDGTH- -TLNLEVKEGAPLPFSYDILSNAFQYGNRALTKYPDDIA 118 

AF168420 dsFP483 NGHYFEI KGKGKGQPNEGTN- -TVTLEVTKGGPLPFGWHI LCPQFQYGNKAFVHHPDNIH 80 

AY015995 spGFP NNHVFSMEGFGKGNVLFGNQ - -LMQIRVTKGGPLPFAFDIVS IAFQYGNRTFTKYPDDIA 83 

AF168423 ZSFP538 NGHKFVITGEGIGYPFKGKQ- -TINLCVIEGGPLPFSEDILSAGFKYGDRIFTEYPQDIV 80 

AF168421 amFP486 NGHYFTVKGEGNGKPYEGTQTSTFKVTMANGGPLAFSFDILSTVFKYGNRCFTAYPTSMP 82 

AY013824 amGFPXM HGHKFSVRGEGEGDADYGKL- - EIKFI CTTG- KLPVPWPTLVTTFS YGIQCFARYPEHMK 79 

P42212 avGFP QHDFFKSAMPEGYVQERTI FFKDDGNYKTRAEV- -KFEG DTLVNRIELKGIDFKEDG 134 

AY015996 rmGFP - -DYFIQSFPAGFMYERTLRYEDGGLVEIRSDI - -NLIE DKFVYRVEYKGSNFPDDG 136 

AF372525 rrGFP — DYFLQSFPEGFTYERNIRYQDGGTAIVKSDI - - SLED GKFIVNVDFKAKDLRRMG 133 

AF168419 dsRED - -DYKiCLSFPEGFKWERVMNFEDGGVVTVTQDS - -SLQD GCFIYKVKFIGVNFPSDG 133 

AF322222 asCP562 - -DYFKQSFPEGFTWERTTTYEDGGFLTAHQDT- -SLDG D CL VYKVKI LGNNF PADG 130 

AF168422 asCP562 - - D YF KNS CPAGYTWDRS FLFEDGAVC I CNAD I TVS VEEN CMYHE S KFYGVNF PADG 135 

AF246709 asFP595 - -DYFKQSFPEGFTWERTTTYEDGGFLTAHQDT- - SLDG DC LVYKVKI LGNNF PADG 130 

AF322221 asFP499 - -DFFKQSLPGGFSWERVSTYEDGGVLSATQET- -SLQG DCI I CKVKVLGTNFPANG 130 

AF384683 mcGFP - -DYFKQTFPEGYSWERIMAYEDQSICTATSDI - -KMEG DCFIYEIQFHGVNFPPNG 129 

AF401282 mfGFP - - D YFKQTFPEGYS WERSMTYEDQGI CVATNDI - - TLMKGVDD C FVYKI RFDG VNF PANG 132 

AF168424 CSFP484 - -DYFKQSFPEGYSWERTMTFEDKGIVKVKSDI- -SMEE DSFIYEIRFDGMNFPPNG 171 

AF168420 dsFP483 - -DYLKLSFPEGYTWERSMHFEDGGLCCITNDI - - SLTG NC FYYD I KFTGLNF P PNG 133 

AY015995 spGFP - -DYFVQSFPAGFFYERNLRFEDGAIVDIRSDI- -SLED DKFHYKVE YRGNGF PSNG 136 

AF168423 ZSFP538 - -DYFKNS CP AGYTWGRS FLFEDGAVC I CNVDI TVS VKEN C IYHKS IFNGMNFPADG 135 

AF168421 amFP486 - -DYFKQAFPDGMSYERTFTYEDGGVATASWEI - -SLKGN CFEHKSTFHGVNFPADG 135 

AY013824 amGFPXM MNDFFKSAMPEGYIQERTIFFQDDGKYKTRGEV- -KFEG DTLVNRIELKGMDFKEDG 134 

P42212 avGFP NILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIR1QTIEDGSVQLADHYQQN- -TPIGDGP 192 

AY015996 rmGFP P VM - QKT I LG I E PS FEAMYMN - - NGVL VGE VI LVYKLNSGKYYS CHMKTL MKSKGW 190 

AF372525 rrGFP PVM-QQDIVGMQPSYESMYTN- -VTSVIGECIIAFKLQTGKHFTYHMRTV YKSKKPV 187 

AF168419 dsRED P VM - QKKTMGWEAS TERLY PR - - DGVLKGE I HKALKLKDGGHYL VEFKS I YMAKKP - 186 

AF322222 asCP562 P RDAEQS- 137 

AF168422 asCP562 P VM - KKMTDNWE P S CEKI I PV PKQGI LKGD VSMYLLL KDGGRLRCQ FDTV YKAKS VP 191 

AF246709 asFP595 P VM - QNKAGRW E PATE I VYE V - - D G VLRGQ S LMAL KC PGGRHLT CHLHTTYRS KKP AS A - 186 

AF322221 asFP499 PVM-QKKTCGWEPSTETVI PR- -DGGLLLRDTPALMLADGGHLSCFMETT YKSKKE- 183 

AF384683 mcGFP PVM-QKKTLKWEPSTEKMYVR- -DGVLKGDVNMALLLEGGGHYRCDFRST YKAKKR- 182 

AF401282 mfGFP PVM-QKKTLKWEPSTEKMYVR- -DGVLKGDVNMALLLEGGGHYRCDFKTT YKAKKF- 185 

AF168424 CSFP484 P VM - QKKTLKWE PS TEIMYVR- - DGVLVGD I SHSLLLEGGGHYRCDFKS I YKAKKV- 224 

AF168420 dsFP483 P W - QKKTTGWE PS TERLYPR - - DGVL I GD IHHALTVEGGGHYACD I KTV YRAKKAA 187 

AY015995 spGFP P VM - QKAI LGME PS FEWYMN- - SGVLVGEVDL VYKLESGNYYS CHMKTF YRSKGGV 190 

AF168423 ZSFP538 P VM - KKMTTNWEAS CEKIMPVPKQGI LKGDVSMYLLLKDGGRYRCQFDTV YKAKS VP 191 

AF168421 amFP486 PVM-AKKTTGWDPSFEKMTVC- -DGILKGDVTAFLMLQGGGNYRCQFHTS YKTK-KP 188 

AY013824 amGFPXM NI LGHKLEYNFNSHNVYIMPDKANNGLKVNFKI RHNI EGGGVQLADHYQTN- - VPLGDGP 192 



P42212 avGFP VLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK 238 
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AY015996 rmGFP KEFPSYHFIQHRLEKTYV-EDGG-FVEQHETAIAQMTSIGKPLGSLHEWV 23 8 

AF372525 rrGFP ETMPLYHFIQHRLVKTNV-DTASGYWQHETAIAAHSTIKKIEGSLP 233 

AF168419 dsRED VQL PG YYYVD S KLD I T SH - NEDYT I VE Q YERTEGRHHL FL 225 

AF322222 asCP562 RKMG ASHRDTL 148 

AF168422 asCP562 RKMPDWHFIQHKLTREDRSDAKNQKWHLTEHAIASGSALP 231 

AF246709 asFP595 LKMPGFHFEDHRIE IMEE ~ VEKGKCYKQYEAAVGRYCDAAPSKLGHN 232 

AF322221 asFP499 VKLPELHFHHLRMEKLNI-SDDWKTVEQHESWASYS-QVPSKLGHN 228 

AF384683 mcGFP VQLPDYHFVDHRIEILSH-DNDYNTVKLSEDAEARYSMLPSQAK 225 

AF401282 mfGFP VQL PD YHF VDHR I E I L S H - DKD YNKVKL YEHAE A - H S GL P RQAK 227 

AF168424 CSFP484 VKLPDYHFVDHRIEI LNH - DKDYNKVTLYENAVARYSLLPSQA 266 

AF168420 dsFP483 LKMPGYHYVDTKLVI WNN- DKEFMKVEEHE I AVARHHPFYEPKKDK 232 

AY015995 spGFP KEFPEYHFIHHRLEKTYV-EEGS-FVEQHETAIAQLTTIGKPLGSLHEWV 238 

AF168423 ZSFP538 SKMPEWHFIQHKLLREDRSDAKNQKWQLTEHAIAFPSALA 231 

AF168421 amFP486 VTMPPNHWEHRIARTDLDKGGNS-VQLTEHAVAHITSWPF 229 

AY013824 amGFPXM VL I P I NH YL S TQTA I S KDRNE TRDHMVF LE F F S ACGHTHGMDE L YK 23 8 
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SEQUENCE LISTING 

<110> Biolmage A/S 

<12 0> TRANSLOCATION DEPENDENT COMPLEMENTATION FOR DRUG SCREENING 
<130> 1017PC1 
<160> 77 

<170> Patentln version 3.1 

<210> 1 

<211> 238 

<212> PRT 

<213> Aequorea victoria 

<400> 1 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
1 5 10 15 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 



Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
3 5 40 45 



Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 



Ser Tyr Gly Val Gin Cys Phe Ser-Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 



His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 
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Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 



Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 



He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 



Gin Leu Ala Asp His Tyr' Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 2 
<211> 238 
<212> PRT 

<213> Aequorea victoria 
<400> 2 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
15 10 15 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 



Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 



WO 03/089627 1 J PCT/DK03/00266 

3 



Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 



Ser Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 



His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 



Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 lib 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 



Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 



He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 



Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
.180 185 190 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 3 

<211> 238 

<212> PRT 

<213> Aequorea victoria 
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<400> 3 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 



Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 



Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 



Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 



Ser His Gly Val Gin Cys Phe Ser Arg Tyr Pro' Asp His Met Lys Gin 
65 70 75 80 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 



Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 



Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 



Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 



Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 150 155 160 



He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 



Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 ^ 190 



Val Leu Leu Pro Asp Asn. His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 



WO 03/089627 PCT/DK03/00266 



Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 4 

<211> 239 

<212> PRT 

<213> Aequorea victoria 

<400> 4 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 



He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



WO 03/089627 §*% W PCT/BK03/00266 

6 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 5 
<211> 239 
<212> PRT 

<213> Aequorea victoria 
<400> 5 . 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys; Leu Thr Leu Lys Phe He 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 ' 95 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 



WO 03/089627 PCT/DK03/00266 



7 

115 120 125 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

130 135 • 140 



Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 



Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu. Asp Gly Ser 
165 170 " 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 6 
<211> 239 
<212> PRT 

<213> Aequorea victoria 
<400> 6 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



WO 03/089621 |gf§| ggg$ PCT/DKOJ/OOW. 

8 

Leu Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 12 5 



lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 



Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 ' 160 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 



Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210- 215 220 



Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



<210> 7 

<211> 121 

<212> DNA 

<213> Human 



<220> 

<221> CDS 



WO 03/089627 




PCT7DK03/00266 



<222> (3) . . (116) 
<223> 



<400> 7 

cc atg gcc ggt ggt acc ggt tec ggt gec ctg aag aag gag ctg cag 47 

Met Ala Gly Gly Thr Gly Ser Gly Ala Leu Lys Lys Glu Leu Gin 
15 10 15 

gcc aac aag aag gag ctg gcc cag ctg aag tgg gag ctg cag gcc ctg 95 
Ala Asn Lys Lys Glu Leu Ala Gin Leu Lys Trp Glu Leu Gin Ala Leu 
20 25 30 

aag aag gag ctg gcc cag tag gatcc 121 
Lys Lys Glu Leu Ala Gin 
35 



<210> 8 

<211> 37 

<212> PRT 

<213> Human 



<400> 8 

Met Ala Gly Gly Thr Gly Ser Gly Ala Leu Lys Lys Glu Leu Gin Ala 
15 10 15 



Asn Lys Lys Glu Leu Ala Gin Leu Lys Trp Glu Leu Gin Ala Leu Lys 
20 25 30 



Lys Glu Leu Ala Gin 
35 



<210> 9 

<211> 121 

<212> DNA 

<213> Human 



<220> 



<221> 



CDS 



4.7 

WO 03/089627 . ' ' fsJf PCT/DKQ3/00266 
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<222> (3) . . (116) 
<223> 



<400> 9 

cc atg gcc age gag cag ctg gag aag aag ctg cag gec ctg gag aag 47 
Met Ala Ser Glu Gin Leu Glu Lys Lys Leu Gin Ala Leu Glu Lys 
15 10 15 

aag ctg gcc cag ctg gag tgg aag aac cag gcc ctg gag aag aag ctg 95 
Lys Leu Ala Gin Leu Glu Trp Lys Asn Gin Ala Leu Glu Lys Lys Leu 
20 25 30 

gcc cag ggc ggc acc ggt tag gatcc 121 
Ala Gin Gly Gly Thr Gly 

35 ... 



<210> 10 

<211> 37 

<212> PRT 

<213> Human 

<400> 10 

Met Ala Ser Glu Gin Leu Glu Lys Lys Leu Gin Ala Leu Glu Lys Lys 
1 5 .10 15 



Leu Ala Gin Leu Glu Trp Lys Asn Gin Ala Leu Glu Lys Lys Leu Ala 
20 25 30 



Gin Gly Gly Thr Gly 
35 



<210> 11 

<211> 19 

<212> PRT 

<213 > Human 



<400> 11 



Gly Ser Gly Ser Gly Ser Gly Asp He Thr Ser Leu Tyr Lys Lys Ala 
15 10 15 



WO 03/089627 



PCT/DK03/00266 



11 



Gly Ser Thr 

<210> 12 

<211> 19 

<212> PRT 

<213> Human 

<400> 12 

Gly Ser Gly Ser Gly Ser Gly Asp lie Thr Ser Leu Tyr Lys Lys Ala 
1.5 10 15 

Gly Ser Thr 

<210> 13 

<211> 18 

<212> PRT 

<213> human 

<400> 13 

Asp Pro Ala Phe Leu Tyr Lys Val Val lie Ser Gly Ser Gly Ser Gly 



Ser Gly 

<210> 14 

<211> 18 

<212> PRT 

<213> Human 

<400> 14 



1 



5 



10 



15 



Asp Pro Ala Phe Leu Tyr Lys Val Val lie Ser Gly Ser Gly Ser Gly 
1 5 10 15 



WO 03/089627 




PCT/DK03/00266 



Ser Gly 



<210> .15 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

<221> mis c_f e a t ur e 

<223> Primer sequence 



<400> 15 

cctactgctt tgagattcgt egg 2 3 

<210> 16 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

<221> mis cofeature 

<223> Primer sequence 



<400> 16 

gtcattccag ttttagaagc tc 



22 



WO 03/089627 HH PCTO>K03.'0026f, 

13 

<210> 17 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc^feature 
<223> Primer sequence 



<400> 17 

cagacaatct gtgtgggcac tcgaccgg 



<210> 18 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f ea tur e 

<223> Primer sequence 



<400> 18 

catggccggt ggtaccggtt ccggtgccct gaagaaggag ctgcagg 



<210> 19 
<211> 38 



<212> DNA 



WO 03/089627 




PCT/DK03/00266 



14 



<213> Artificial Sequence 

<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f ea tur e 
<223> Primer sequence 

<400> 19 

agctccttct tcagggcacc ggaaccggta ccaccggc 

<210> 20 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f ea t ur e 
<223> Primer sequence 

<400> 20 

ccaacaagaa ggagctggcc cagctgaagt gggagctgca g 

<210> 21 

<211> 40 

<212> DNA 

<213> Artificial Sequence 



<220> 



WO 03/089627 f§P PCT/DK03/00266 

15 

<223> Primer sequence 
<220> 

<221> misc_feature 
<223> Primer sequence 

<400> 21 

ctcccacttc agctgggcca gctccttctt gttggcctgc 

<210> 22 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_feature 

<223> Primer sequence 



<400> 22 

gccctgaaga aggagctggc ccagtag 

<210> 23 

<211> 37 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s cofeature 
<223> Primer sequence 



WO 03/089627 

16 



<400> 23 

gatcctactg ggccagctcc ttcttcaggg cctgcag 



<210> 24 

<211> 40 

<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> Primer sequence 
<220> 

< 2 2 1 > mis c_feature 

<22 3> Primer sequence 

<400> 24 

catggccagc gagcagctgg agaagaagct gcaggccctg 

<210> 25 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> mis cofeature 

<223> Primer sequence 



<400> 25 

cctgcagctt cttctccagc tgctcgctgg c 



WO 03/089627 §M? P^XT>K'J3;0«>266 

17 

<210> 26 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> mis cofeature 
<223> Primer sequence 



<400> 26 

gagaagaagc tggcccagct ggagtggaag aaccaggccc tggag 



<210> 27 

<211> 44 

<212> DNA • 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<2 2 1 > mi s cofeature 

<223> Primer sequence 



<400> 27 

ggcctggttc ttccactcca gctgggccag cttcttctcc aggg 



<210> 28 
<211> 30 
<212> DNA 



WO 03/089627 

<213> Artificial Sequence 



18 



<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f ea tur e 

<223> Primer sequence 



<400> 28 

aagaagctgg cccagggcgg caccggttag 



<210> 29 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> mis cofeature 

<223> Primer sequence 

<400> 29 

gatcctaacc ggtgccgccc tgggccagct tcttctccag 

<210> 30 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<:220> 

<223> Primer sequence 



WO 03/089627 



PCT/DK03/00266 



19 

<220> 

<221> misc_f eature 
<223> Primer sequence 

<400> 30 

ggcgccatgg tgagcaaggg cgag 

<210> 31 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer sequence 
<220> 

<221> misc_f eature 

<223> Primer sequence 

<400> 31 

gccggaccgg taccaccgtt gtactccagc ttgtg 

<210> 32 

<211> 33 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 

<223> Primer sequence 



WO 03/089627 




PCT/DK03/00266 



<400> 32 

gccggaccgg taccaccctg cttgtcggcc atg 

<210> 33 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mis cofeature 

<223> Primer sequence 

<400> 33 

gccggaccgg taccaccctc gatgttgtgg cggatc 

<210> 34 

<211> 32 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f e a t ur e 

<223> Primer sequence 



<400> 34 

ccccggatcc tacttgtaca gctcgtccat gc 



WO 03/089627 



21 



<210> 35 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer sequence 
<220> 

<221> misc_f eature 

<2 2 3 > Primer sequence 



<400> 35 

ggcgccatgg gcaccggtta caacagccac aacgtc 



<210> 36 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s cofeature 
<223> Primer sequence 



<400> 36 

ggcgccatgg gcaccggtaa gaacggcatc aaggtg 

<210> 37 
<211> 36 
<212> DNA 



WO 03/089627 ' J PCT/DK03/00266 

22 

<2I3> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 
<223> Primer sequence 



<400> 37 

ggcgccatgg gcaccggtga cggcagcgtg cagctc 



<210> 38 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mis c_f e a tur e 

<223> Primer sequence 



<400> 38 

gggggctagc gccaccatgg tgagcaaggg cgag 



<210> 39 

<211> 57 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 



WO 03/089627 




PCT/DK03/00266 



<220> 

- <221> misc_f eature 

<223> Primer sequence 

<400> 39 

gcgggggatc cgatatcgcc agagccagag ccagagccct cgatgttgtg gcggatc 

<210> 40 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<2 2 1 > mi s c_f eature 

<223> Primer sequence 

<400> 40 

gggggctagc gatatccggc tctggctctg gctctggcga cggcagcgtg cagctc 

<210> 41 

<211> 64 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<22l> misc_f eature 

<223> Primer sequence 



WO 03/089627 f PCT/DK03/00266 
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<400> 41 

gcccaccctc gtgaccacct tcggctacgg cctgcagtgc ttcgcccgct accccgacca 
catg 

<210> 42 
<211> 64 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 
<223> Primer sequence 



<400> 42 

catgtggtcg gggtagcggg cgaagcactg caggccgtag ccgaaggtgg tcacgagggt 

gggc 



<210> 43 

<21.1> 64 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f e a tur e 

<223> Primer sequence 



<400> 43 



WO 03/089627 




PCT/DK03/00266 



25 

gcccaccctc gtgaccaccc tgggctacgg cctgcagtgc ttcgcccgct accccgacca 
catg 

<210> 44 
<211> 64 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Primer sequence 
<220> 

<221> misc_f eature 
<22 3> Primer sequence 

<400> 44 

catgtggtcg gggtagcggg cgaagcactg caggccgtag cccagggtgg tcacgagggt 
gggc 

<210> 45 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

< 2 2 3 > Prime r s equenc e 
<220> 

< 2 2 1 > mi s c_f e a t ur e 
<223> Primer sequence 



<400> 45 

gacaaccact acctgagcta ccagtccgcc ctgagc 



WO 03/089627 



PCT/DK03/00266 



26 



<210> 46 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s cofeature 

<223> Primer sequence 



<400> 46 

gctcagggcg gactggtagc tcaggtagtg gttgtc 



<210> 47 

<211> .26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 

<223> Primer sequence 



<400> 47 

ccaccatggg agtgcaggtg gaaacc 

<210> 48 
<211> 19 
<212> DNA 



WO 03/089627 



PCT/DK03/00266 



27 



<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 

<223> Primer sequence 

<400> 48 

cttccagttt tagaagctc 19 

<210> 49 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> mis cofeature 

<223> Primer sequence 



<210> 50 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 



<400> 49 

ccaccatgga gatgtggcat gaaggcctg 



29 



\ 



WO 03/089627 



PCT/DK03/00266 



28 



<220> 



<221> 



misc feature 



<223> 



Primer sequence 



<400> 50 

cctgctttga gattcgtcgg aacac 



25 



<210> 51 

<211> 74 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_feature 

<22 3> Primer sequence 

<400> 51 

tcctaggtca gtcctgctcc tcggccacga agtgcactcc taggctgcag cacgtgttga 60 
caattaatca tcgg 74 

<210> 52 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc feature 



WO 03/089627 





PCT/DK03/00266 



29 



<223> Primer sequence 



<400> 52 

cagacaatct gtgtgggcac tcgaccgg 



28 



<210> 



53 



<211> 



59 



<212> 



DNA 



<213> 



Artificial Sequence 



<220> 



<223> Primer sequence 
<220> 

<221> misc__f eature 

<223> Primer sequence 

<400> 53 

gcgggggatc cgatatcgcc agagccagag ccagagccct gggccagctc cttcttcag 59 

<210> 54 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 

<223> Primer sequence 



<400> 54 

cctacttgga gctggtgtac ttggtgac 



28 



WO 03/089627 




PCT/DK03/00266 



<210> 55 

<211> 56 

<212> DNA 

<213> Artificial sequence 
<220> 

<223>" Primer sequence 

<400> 55 

gggggctagc gatatccggc tctggctctg gctctggcga cggcagcgtg cagctc 

<210> 56 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc^feature 

<223> Primer sequence 

<400> 56 

ccaccatggc tactcaagct gatttgatg 

<210> 57 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 
<223> 



Primer sequence 



WO 03/089627 





PCT7DK03/00266 



<220> 



<221> misc feature 



<223> Primer sequence 



<400> 57 

ccaggtcagt atcaaaccag gccag 



25 



<210> 58 

<211> 28 

<212> DNA 

<2l3> Artificial. sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_feature 

<223> Primer sequence 

<400> 58 

cctacaggtc agtatcaaac caggccag 2 8 

<210> 59 

<211> 36 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s cofeature 

<223> Primer sequence 



WO 03/089627 




PCT7DK03/00266 



<400> 59 

gttgctagca ccatgcctga gccagccaag tctgct 36 

<210> 60 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_ f eature 

<223> Primer sequence 



<400> 60 

gttgatatcc cttggagctg gtgtacttgg tgac 34 

<210> 61 
<211> 29 
<212> DNA 



<213> Artificial sequence 



<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s c_f eature 

<223> Primer sequence 



<400> 61 

ccaccatgcc gcagctgaac ggcggtgga 



29 



WO 03/089627 f^J| PCT/liKGJ/OO^o 

33 

<210> 62 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_f eature 

<223> Primer sequence 

<400> 62 

caggcggagg ccgtctttcc gc 

<210> 63 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<220> 

< 2 2 1 > mi s cofeature 

<223> Primer sequence 



<400> 63 

gttcagctga tgggagtgca ggtggaaacc atctcc 

<210> 64 
<211> 60 
<212> DNA 



WO 03/089627 





PCT7DK03/00266 



34 



<213> 



Artificial sequence 



<220> 



<223> 



Primer sequence 



<220> 



<221> 



misc feature 



<223> 



Primer sequence 



<400> 64 

gttggatccg atatcaccgg tgcctgtccc agttccttcc agttttagaa gctccacatc 



60 



<210> 65 

<211> 62' 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Primer sequence 
<220> 

<221> misc_feature 

<223> Primer Sequence 

<400> 65 

gttgctagcg atatccggaa cgggcactgg gaccggtgag atgtggcatg aaggcctgga 60 
ag 62 

<210> 66 

<211> 35 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Primer sequence 
<220> 

<221> mis cofeature 
<223> Primer Sequence 

<400> 66 

gttcccggga gctgctttga gattcgtcgg aacac 

<210> 67 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer sequence 
<220> 

<221> misc_feature 

<223> Primer sequence 

<400> 67 

ccaccatgcc tgagccagcc aagtctgct 

<210> 68 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer sequence . 
<220> 
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<221> misc_f eature 
<223> Primer sequence 



<400> 68 

ccttggagct ggtgtacttg gtgac 

<210> 69 

<211> 18 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker Sequence 

<400> 69 

Asp Pro Ala Phe Leu Tyr Lys Val Val lie Ser Gly Ser Gly Ser Gly 
1 5 10 15 

Ser Gly 



<210> 70 

<211> 19 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Linker sequence 
<400> 70 

Gly Ser Gly Ser Gly Ser Gly Asp lie Thr Ser Leu Tyr Lys Lys Ala 
1 5 10 15 

Gly Ser Thr 
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<210> 71 

<211> 16 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Linker sequence 
<400> 71 

Asp Pro Ala Phe Leu Tyr Lys Val Val lie Ser Gly Ser Gly Ser Gly 
15 10 15 

<210> 72 

<211> 9 

<212> PRT 

<213> Artificial sequence 



<220> 

<22 3> Linker sequence 

<400> 72 

Gly Ser Gly Ser Gly Ser Gly Asp Leu 
1 5 

<210> 73 

<211> 18 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Linker sequence 
<400> 73 

Asp Pro Ala Phe Leu Tyr Lys Val Val He Ser Gly Thr Gly Thr Gly 
1 5 10 15 
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Thr Gly 



<210> 74 

<211> 19 

<212> PRT 

<213> Artificial sequence 
<220> 

<22 3> Linker sequence 

<400> 74 

Gly Ser Gly Ser Gly Ser Gly Asp lie Thr Ser Leu Tyr Lys Lys Ala 
1 5 10 15 

Gly Ser Thr 

<210> 75 
<211> 12 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker sequence 
<400> 75 

Asp He Thr Ser Leu Tyr Lys Lys Ala Gly Ser Thr 
15 10 

<210> 76 

<211> 19 

<212> PRT 

<213> Artificial sequence 



WO 03/089627 




PCT/DK03/00266 



<220> 

<223> Linker sequence 
<400> 76 

Gly Thr Gly Thr Gly Thr Gly Asp lie Thr Ser Leu Tyr Lys Lys Ala 
1 5 10 15 

Gly Ser Thr 

<210> 77 

<211> 10 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Linker sequence 

<400> 77 

Leu Pro Ser Gly Ser Gly Ser Gly Ser Gly 
1 5 10 
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